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Abstract 



- -a -rRsent invention relates to novel •dennfiec polynucleotides polypeptides encoded by i^er 
■;~c ;r; tne use c' sue" ooiy nucleotides anc polypeptides, and to their production More 
f; p.a~ cuiarty the poiynucieotiaes ana polypeptides of the present invention relate to the G-protem 
co'jp:ec receptor family, referred to as IGS4-family. Tne invention aiso rebates to inhibiting or 
act valine the action of such, polynucleotides and polypeptides, to a vector containing said 
polynucleotides, a host cell containing such vector and transgenic animals where the IGS4-gene 
is either overexpressed, misexpressed, underexpressed or suppressed (knock-out animals) 

10 The invention further relates to a method for screening compounds capable to act as an agonist 
or an antagonist of said G-protem coupled receptor family IGS4 and the use of IGS4 
polypeptides and polynucleotides and agonists or antagonists to the IGS4 receptor family >n tne 
treatment of CNS disorders, including schizophrenia, episodic paroxysmal anxiety (EPA) 
disorders such as oDsessive compulsive disorder (OCD), post traumatic stress disorder (PTSD). 

15 phobia and panic, major depressive disorder, bipolar disorder. Parkinsons disease, general 
anxiety disorder, autism, delirium. Alzheimer disease/dementia, severe mental retardation and 
dyskinesias, such as Huntington s disease or GiHes dela Tourett's syndrome, anorexia, pum-a. 
stroke, adrfict'on/depenoency/craving, sleep d'sorder. epilepsy, m gra^ne, attent-u- 
cetV/.t/hyperactivty d-sorder (ADHD): cardiovascular disease including hear: fa^u'e lv-< : ; 

20 pectoris. arrhytnmias, myocsrciai infarction. nyootens;on; nypartens.on, t^.romoosis. 
dvs'ipidemias; Obesity; emesis; IBS, IED. GERD, conditions of delayed gastric emptying. SL.cn 
as postoperative or diabetic cj.-isfopp.resis; osteoporosis, infections such as bacterial, funcai. 
protozoan anc vzal infections, particularly infections caused by HIV- • or HIV 2; pain; cancers; 
.s nary retention, ulcers, asthma; aiiergios and benign prostatic hypertophy. among others anc 

25 dacnostic assays for such conditions. 
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Description 



5 The present invention relates to novel identified polynucleotides, polypeptides encoded by them 

and to the use of such polynucleotides and polypeptides, and to their production. More particularly, the 
polynucleotides and polypeptides of the present invention relate to a G-protein coupled receptor 
(GPCR), hereinafter referred to as 1GS4. IGS4 exists in two polymorphic forms, hereinafter referred to 
as IGS4A and IGS4B. The invention also relates to inhibiting or activating the action of such 
10 polynucleotides and polypeptides, to a vector containing said polynucleotides, a host cell containing 
such vector and transgenic animals where the IGS4-gene is either overexpressed, misexpressed, 
underexpressed or suppressed (knock-out animals). The invention further relates to a method for 
screening compounds capable to act as an agonist or an antagonist of said G-protein coupled receptor 
IGS4. 



BACKGROUND OF THE INVENTION 



It is well established that many medically significant biological processes are mediated by proteins 
participating in signal transduction pathways that involve G-proteins and/or second messengers; e.g., 

20 cAMP (Lefkowitz, Nature, 1991, 351:353-354). Herein these proteins are referred to as proteins 
participating in pathways with G-proteins. Some examples of these proteins include the GPC receptors, 
such as those for adrenergic agents and dopamine (Kobilka, B.K., et al., Proc. Natl. Acad. ScL, USA, 
1987, 84:46-50; Kobilka, B.K., et al., Science, 1987, 238:650-656; Bunzow, J.R., et al., Nature, 1988, 
336:783-787), G-proteins themselves, effector proteins, e.g., phospholipase C, adenyl cyclase, and 

25 phosphodiesterase, and actuator proteins, e.g., protein kinase A and protein kinase C (Simon, M.I., et 
al., Science, 1991, 252:802-8). 

^ For example, in one form of signal transduction, upon hormone binding to a GPCR the receptor 

interacts with the heterotrimeric G-protein and induces the dissociation of GDP from the guanine 

30 nucleotide-binding site. At normal cellular concentrations of guanine nucleotides, GTP fills the site 
immediately. Binding of GTP to the a subunit of the G-protein causes the dissociation of the G-protein 
from the receptor and the dissociation of the G-protein into a and Py subunits. The GTP-carrying form 
then binds to activated adenylate cyclase. Hydrolysis of GTP to GDP, catalyzed by the G-protein itself 
(« subunit possesses an intrinsic GTPase activity), returns the G-protein to its basal, inactive form. The 

35 GTPase activity of the a subunit is, in essence, an internal clock that controls an on/off switch. The GDP 
bound form of the a subunit has high affinity for [3y and subsequent reassociation of c.GDP with fv.< 
returns the system to the basal state. Thus the G-protein serves a dual role, as an intermediate that 
relays the signal from receptor to effector (in this example adenylate cyclase), and as a clock that 
controls the duration of the signal. 
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The membrane bound superfamily of G-protein coupled receptors has been characterized as 
having seven putative transmembrane domains. The domains are believed to represent transmembrane 
cx-helices connected by extracellular or cytoplasmic loops. G-prctein coupled receptors include a wide 
5 range of biofogically active receptors, such as hormone, viral, growth factor and neuroreceptors. 

The G-protein coupled receptor family includes dopamine receptors which bind to neuroleptic 
drugs used for treating CNS disorders. Other examples of members of this family include, but are not 
limited to calcitonin, adrenergic, neuropeptideY, somastotatin, neurotensin, neurokinin, capsaicin, VIP, 
10 CGRP, CRF, CCK, bradykinin, galanin, motilin, nociceptin, endothelin, cAMP, adenosine, muscarinic, 
acetylcholine, serotonin, histamine, thrombin, kinin, follicle stimulating hormone, opsin, endothelial 
differentiation gene-1, rhodopsin, odorant, and cytomegalovirus receptors. 

Most G-protein coupled receptors have single conserved cysteine residues in each of the first two 
^5 extracellular loops which form disulfide bonds that are believed to stabilize functional protein structures. 
The 7 transmembrane regions are designated as TM1, TM2, TM3, TM4, TM5, TM6 and TM7. The 
cytoplasmic loop which connects TM5 and TM6 may be a major component of the G-protein binding 
domain. 

20 Most G-protein coupled receptors contain potential phosphorylation sites within the third 

cytoplasmic loop and/or the carboxy terminus. For several G-protein coupled receptors, such as the (3- 
adrenoreceptor, phosphorylation by protein kinase A and/or specific receptor kinases mediates receptor 
desensitization. 

25 Recently, it was discovered that certain GPCRs, like the caicitonin-receptor like receptor, might 

interact with small single pass membrane proteins called receptor activity modifying proteins (RAMP's). 
This interaction of the GPCR with a certain RAMP is determining which natural ligands have relevant 
^ affinity for the GPCR-RAMP combination and regulate the functional signaling activity of the complex 
(McLathie, L.M. eta!., Nature (1998) 393:333-339). 



For some receptors, the ligand binding sites of G-protein coupled receptors are believed to 
comprise hydrophilic sockets formed by several G-protein coupled receptor transmembrane domains, 
said sockets being surrounded by hydrophobic residues of the G-protein coupled receptors. The 
hydrophilic side of each G-protein coupled receptor transmembrane helix is postulated to face inward 
and form a polar ligand-binding site. TM3 has been implicated in several G-protein coupled receptors as 
having a ligand-bmding site, such as the TM3 aspartate residue. TM5 serines, a TM6 asparagine and 
TM5 or TM7 phenylalanines or tyrosines are also implicated in ligand binding. 
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G-protein coupled recej^^can be intracellularly coupled by heierc^Kric G-proteins to various 
intracellular enzymes, ion channels and transporters (see, Johnson et al., Endoc. Rev., 1989, 10:317- 
331). Different G-protein a-subunits preferentially stimulate particular effectors to modulate various 
biological functions in a cell. Phosphorylation of cytoplasmic residues of G-protein coupled receptors 
5 has been identified as an important mechanism for the regulation of G-protein coupling of some G- 
protein coupled receptors. G-protein coupled receptors are found in numerous sites within a mammalian 
host. 

Receptors - primarily the GPCR class - have led to more than half of the currently known drugs 
10 (Drews, Nature Biotechnology, 1996, 14: 1516). This indicates that these receptors have an established, 
proven history as therapeutic targets. Clearly there is a need for identification and characterization of 
further receptors which can play a role in preventing, ameliorating or correcting dysfunctions or 
diseases, including, but not limited to CNS disorders, including schizophrenia, episodic paroxysmal 
anxiety (EPA) disorders such as obsessive compulsive disorder (OCD), post traumatic stress disorder 
^5 (PTSD), phobia and panic, major depressive disorder, bipolar disorder, Parkinson's disease, general 
™ anxiety disorder, autism, delirium, Alzheimer disease/dementia, severe mental retardation and 
dyskinesias, such as Huntington's disease or Gilles dela Tourett's syndrome, anorexia, bulimia, stroke, 
addiction/ dependency/craving, sleep disorder, epilepsy, migraine; attention deficit/hyperactivity disorder 
(ADHD); cardiovascular disease including heart failure, angina pectoris, arrhythmias, myocardial 
20 infarction, hypotension; hypertension, thrombosis; dysiipidemias; obesity; emesis; IBS; IBD; GERD; 
conditions of delayed gastric emptying, such as postoperative or diabetic gastroparesis; osteoporosis; 
infections such as bacterial, fungal, protozoan and viral infections, particularly infections caused by HIV- 
1 or HlV-2; pain; cancers; urinary retention; ulcers; asthma; allergies and benign prostatic hypertrophy. 

25 

SUMMARY OF THE INVENTION 

^ In one aspect, the invention relates to IGS4 polypeptides (including the IGS4A and !GS4B 

30 polypeptide polymorphs) and recombinant materials and methods for their production. Another aspect of 
the invention relates to methods for using such IGS4 polypeptides and polynucleotides. Such uses 
include the treatment of CNS disorders, including schizophrenia, episodic paroxysmal anxiety (EPA) 
disorders such as obsessive compulsive disorder (OCD), post traumatic stress disorder (PTSD), phobia 
and panic, major depressive disorder, bipolar disorder, Parkinson's disease, general anxiety disorder, 
35 autism, delirium, Alzheimer disease/dementia, severe mental retardation and dyskinesias, such as 
Huntington's disease or Gilles dela Tourett's syndrome, anorexia, bulimia. stroke, 
addiction/dependency/craving, sieep disorder, epilepsy, migraine; attention deficil'hyperactivity disorder 
(ADHD); cardiovascular disease including heart failure, angina pectoris, arrhythmias, myocardial 
infarction, hypotension; hypertension, thrombosis; dysiipidemias; obesity; emesis; IBS. IBD: GERD: 
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conditions of cleiayed gastric'emptying, such as postoperative or diabW^astroparesis; osteoporosis; 
infections such as bacterial, fungai, protozoan and viral infections, particularly infections caused by HIV- 
1 or HlV-2; pain; cancers: urinary retention; ulcers; asthma; allergies and benign prostatic hypertrophy, 
among others. 

5 In still another aspect, the invention relates to methods to identify agonists and antagonists using 

the materials provided by the invention, and treating conditions associated with IGS4 imbalance with the 
identified compounds. Yet another aspect of the invention relates to diagnostic assays for detecting 
diseases associated with inappropriate IGS4 activity or levels. A further aspect of the invention relates to 
animal-based systems which act as models for disorders arising from aberrant expression or activity of 
10 IGS4. 
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rable 1: IGS4A-DNA of SEQlBRO: 1 and SEQ ID NO: 3 



10 



20 



25 



30 



GGCTCAGCTTGAAACAGAGCCTCGTACCAGGGGAGGCTCAGGCCTTGGATTTTAATGTCA 

G GG ATGG A AAAACTT C AG AATG C TT C G T GG AT CT AC C AG C AG AAACT AG AAG AT C CAT T C 

CAGAAACACCTGAACAGCACCGAGGAGTATCTGGCCTTCCTCTGCGGACCTCGGCGCAGC 

CACTTCTTCCTCCCCGTGTCTGTGGTGTATGTGCCAATTTTTGTGGTGGGGGTCATTGGC 

AATGTCCTGGTGTGCCTGGTGATTCTGCAGCACCAGGCTATGAAGACGCCCACCAACTAC 

TACCTCTTCAGCCTGGCGGTCTCTGACCTCCTGGTCCTGCTCCTTGGAATGCCCCTGGAG 

GTCTATGAGATGTGGCGCAACTACCCTTTCTTGTTCGGGCCCGTGGGCTGCTACTTCAAG 

ACGGCCCTCTTTGAGACCGTGTGCTTCGCCTCCATCCTCAGCATCACCACCGTCAGCGTG 

GAGCGCTACGTGGCCATCCTACACCCGTTCCGCGCCAAACTGCAGAGCACCCGGCGCCGG 

GCCCTCAGGATCCTCGGCATCGTCTGGGGCTTCTCCGTGCTCTTCTCCCTGCCCAACACC 

AGCATCCATGGCATCAAGTTCCACTACTTCCCCAATGGGTCCCTGGTCCCAGGTTCGGCC 

ACCTGTACGGTCATCAAGCCCATGTGGATCTACAATTTCATCATCCAGGTCACCTCCTTC 

CTATTCTACCTCCTCCCCATGACTGTCATCAGTGTCCTCTACTACCTCATGGCACTCAGA 

CTAAAGAAAGACAAATCTCTTGAGGCAGATGAAGGGAATGCAAATATTCAAAGACCCTGC 

AGAAAATCAGTCAACAAGATGCTGTTTGTCTTGGTCTTAGTGTTTGCTATCTGTTGGGCC 

CCGTTCCACATTGACCGACTCTTCTTCAGCTTTGTGGAGGAGTGGAGTGAATCCCTGGCT 

GCTGTGTTCAACCTCGTCCATGTGGTGTCAGGTGTCTTCTTCTACCTGAGCTCAGCTGTC 

AACCCCATTATCTATAACCTACTGTCTCGCCGCTTCCAGGCAGCATTCCAGAATGTGATC 

TCTTCTTTCCACAAACAGTGGCACTCCCAGCATGACCCACAGTTGCCACCTGCCCAGCGG 

AACATCTTCCTGACAGAATGCCACTTTGTGGAGCTGACCGAAGATATAGGTCCCCAATTC 

CCATGTCAGTCATCCATGCACA^CTCTCACCTCCCAACAGCCCTCTCTAGTGAACAGATG 

TCAAGAACAAACTATCAAAGCTTCCACTTTAACAAAACCTGAATTCTTTCAGAGCTGACT 

CTCCTCTATGCCTCAAAACTTCAGAGAGGAACATCCCATAATGTATGCCTTCTCATATGA 

TATTAGAGAGGTAGAATGGCTCTTACAACTCATGTACCCATTGCTAGTTTTTTTTTTTTA 

ATAAACGTGAAAACTGAGAGTTAGATCTGGTTTCAAAACCCAAGACTGCCTGATTTTTAG 

TTATCTTTCCACTATCCTAACTGCCTCATGCCCCTTCACTAGTTCATGCCAAGAAC3TGA 

CTGGAAAGGCATGGCACCTATACCTTGATTAATTTCCATTAATGGAAATGGTTCGTCCTG 

AGT C AT CT ACG TT C C G AGT C AG G C T G T C AC TCCTACTA- 3 ' 



"able 2: IGS4E-DNA of SEQ ID NO: 5 anc SEQ ID NO: 7 



: 333A7GGAAAAACTTCAGAATGC7TCCTGGATCTACCAGCAGAAACTAGAAGATCCA I - C 

! GAG AAA 3 A C CT G AA C^GC^-.CC GAG G AGT AT C T G G C CTT C CT C T G C GG AC CT CGG C G C AG C 

! 

. CACTTCTTCCTCCCCGTGTCTGTGGTGTATGTGCCAATTTTTGTGGTGGGGGTCATTGGC 
1 A-.TGTCCTGGTGTGCCTGGTGATTCTGCAGCACCAGGCTATGAAGACGCCCACCAACTAC 
: TACCTCTTCAGCCTGGCGGTCTCTGACCTCCTGGTCCTGCTCCTTGGAATGCCCCTGGAG 
GTCTATGAGATGTGGCGCAACTACCCTTTCTTGTTCGGGCCCGTGGGCTGCTACTTCAAG 
ACGGCCCTCTTTGAGACCGTGTGCTTCGCCTCCATCCTCAGCATCACCACCGTCAGCGTG 
GAGCGCTACGTGGCCATCCTACACCCGTTCCGCGCCAAACTGCAGAGCACCCGGCGCCGG 
GCCCTCAGGATCCTCGGCATCGTCTGGGGCTTCTCCGTGCTCTTCTCCCTGCCCAACACC 
AGCATCCATGGCATCAAGTTCCACTACTTCCCCAATGGGTCCCTGGTCCCAGGTTCGGCC 
ACCTGTACGGTCATCAAGCCCATGTGGATCTACAATTTCATCATCCAGGTCACCTCCTTC 
CTATTCTACCTCCTCCCCATGACTGTCATCAGTGTCCTCTACTACCTCATGGCACTCAGA 
C T AAAG AAAG A C AAAT CT C TTG AGG C AG ATG AAG GG AATG C AAAT ATT C AAAG AC C C T G C 
AGAAAATCAGTCAACAAGATGCTGTTTGTCTTGGTCTTAGTGTTTGCTATCTGTTGGGCC 
CC3TTCCACATTGACCGACTCTTCTTCAGCTTTGTGGAGGAGTGGACTGAATCCCTGGCT 
GCTGTGTTCAACCTCGTCCATGTGGTGTCAGGTGTCTTATTCTACCTGAGCTCAGCTGTC 
AACCCCATTATCTATAACCTACTGTCTCGCCGCTTCCAGGCAGCATTCCAGAATGTGATC 
TCTTCTTTCCACAAACAGTGGCACTCCCAGCATGACCCACAGTTGCCACCTGCCCAGCGG 
AAC ATCTTCCTGACAGAATGCCACTTTGTGG AG CTGACCG AAG ATATAGGTCCCC AATT C 
CTATGTCAGTCATCCGTGCACAACTCTCACCTCCCAACAGCCCTCTCTAGTGAACAGATG 
TCAAGAACAAACTATCAAAGCTTCCACTTTAACAAAACCTGAATTCTTTCAGAGCTGACT 
CTCCTCTATGCCTCAAAACTTCAGAGAGGAACATCCCATAATGTATGCCTTCTCATATGA 
AATTAGAGAGGTAGAATGGCTCTTACAACTCATGTACCCATTGCTAGTTTTTTTTTTTTA 
ATAAACGTGAAAACTGAGAGTTAGATCTGGTTTCAAAACCCAAGACTGCCTGATTTTTAG 
TTATCTTTCCACTATCCTAACTGCCTCATGCCCCTTCACTAGTTCATGCCAAGAACGTGA 
CT3GAAAGGCATGGCACCTATACCTTGATTAATTTCCATTAATGGAAATGGTTCGTCCTG 
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Table 4: IGS4A-protein of SEQ ID NO: 2 and SEQ ID NO: 4 (without the three amino acids 
between brackets). 



I (. MS G ) ME KLQNAS W I Y Q QKLED PFQ KHLNSTEE YLAFL CGPRRSHFFLPVS VVYVP I FWGV 
I GNVLVCIiVILQHQAMKTPTNYYLFS LAVSDLLVLLLGMFLEVYEMWRNYPFLFGPVGCY 
F KTAL FETVCFAS I LS I TFvSVE R YY A I LHP FRAKLQ S TRRRALR ILGIVWGFSVLFSLP 

i NTS IHGI KFKY F PNGS L VPGS ATCTV I KPMW I YNF 1 1 QVT SFLF YLLPMTV I SVLYYLMA 
LRLKKDKSLEABEGKANIQRPCRKSVNKMLFVLVLVFAICWAPFHIDRLFFSFVEEWSES 
LAAVF^VHWSGVFFYLSSAVNPIIYmjLSRRFQAAFQNVISSFHKQWHSQHDPQLPPA 
QRNIFLTECHFVELTEDIGPQFPCQSSMHNSHLPTALSSEQMSRTNYQSFHFNKT 



Table 5: IGS4B-protein of SEQ ID NO: 6 and SEQ ID NO: 8 (without the three amino 
acids between brackets). 



(MSG)MEKLQNASWIYQQKLEDPFQKHLNSTEEYLAFLCGPRRSHFFLPVSVVYVPIFVVGV 
IGNVLVCLVILQHQAMKTPTNYYLFSLAVSDL^ 

FKTALFETVCFASILSITTVSVERY\ r AILHPFRAKiQSTRRRAiRILGIWGFSVLFSLP 
NTS IHGIKFHYFPNGSLVPGSATCTVIKPMWI YNFI I QVTSFLFYLLPMTVI SVLYYLMA 
LRLKKDKSLEADEGNANIQRPCRKSWKMLFVLVLVFAICWAPFHIDRLFFSFVEEWTES 
LAAVFNLVHWS GVL FYLSS AVN P II YNLLS RRFQAAFQNVI S S FHKQWHS QHD PQLPPA 
QRNIFLTECHFVELTEDIGPQFLCQSSVHNSKLPTALSSEQMSRTNYQSFHFNKT 
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DESCRIPTION OF THE INVENTION 




Definitions 



30 The following definitions are provided to facilitate understanding of certain terms used frequently 

herein. 

"iGS4" refers, among others, to a polypeptide comprising the amino acid sequence set forth in 
SEQ ID NO: 2 or SEQ ID NO: 4 (IGS4A) and SEQ ID NO: 6 or SEQ ID NO: 8 (IGS4B), or an allelic 
25 variant thereof. 

"Receptor Activity" or "Bicicgical Activity of the Receptor" refers to the metabolic or physiologic 
function of said IGS4 including similar activities or improved activities or these activ ties 'with decreased 
a: -desirable side effects. Also induced are antigenic and immunogenic activities of said IGS-. 



"IGS--cer,e" refers to a polynucleotide comprising the nucleotide sequence set forth In SEC ID 
NC: 1. SEC ID NO: 2, SEC ID N'C: 5 or SEC ID NO: 7 or allelic variants thereof and/or ineir 
complements. 

5 

"Antibodies" as usee herein includes polyclonal and monoclonal antibodies, chimeric, single 
chain, and humanized antibodies, as well as Fab fragments, including the products of a Fab or other 
immunoglobulin expression library. 

10 "Isolated" means altered "by the hand of man" from the natural state and/or separated from the 

natural environment. Thus, if an "isolated" composition or substance that occurs in nature has been 
"isolated", it has been changed or removed from its original environment, or both. For example, a 
polynucleotide or a polypeptide naturally present in a living animal is not "isolated," but the same 
polynucleotide or polypeptide separated from the coexisting materials of its natural state is "isolated", as 
^5 the term is employed herein. 

"Polynucleotide" generally refers to any polyribonucleotide or polydeoxribonucleotide, which may 
be unmodified RNA or DNA or modified RNA or DNA. "Polynucleotides" include, without limitation 
singfe- and double-stranded DNA, DNA that is a mixture of single- and double-stranded regions, single- 

20 and double-stranded RNA, and RNA that is a mixture of single-and doubie-stranded regions, hybrid 
molecules comprising DNA and RNA that may be single-stranded or, more typically, double-stranded or 
a mixture of single- and doubie-stranded regions. In addition, "polynucleotide" may also include triple- 
stranded regions comprising RNA or DNA or both RNA and DNA. The term polynucleotide also 
includes DNAs or RNAs containing one or more modified bases and DNAs or RNAs with backbones 

25 modified for stability or for other reasons. "Modified" bases include, for example, tritylated bases and 
unusual bases such as inosine. A variety of modifications has been made to DNA and RNA; thus, 
"polynucleotide" embraces chemically, enzymatically or metabolically modified forms of polynucleotides 
^ as typically found in nature, as well as the chemical forms of DNA and RNA characteristic of viruses and 

' cells. "Polynucleotide" also embraces relatively short polynucleotides, often referred to as 

30 oligonucleotides. 

"Polypeptide" refers to any peptide or protein comprising two or more amino acids joined to each 
other by peptide bonds or modified peptide bonds, i.e., peptide isosteres. "Polypeptide" refers to both 
shont cha.ns. commonly referred to as peptides, oligopeptides or oligomers, and to longer chains. 
35 generally referred to as proteins. Polypeptides may contain amino acics ether than the 20 gene- 
encoded ammo acids. "Polypeptides" include amino acid sequences modified either by natural 
processes, such as oositrarslaticnal processing, or by chemical modification techniques which a-e we;! 
known in the art. Such modificat.c-s a-e well-described in basic texts and in more detailed monograons. 
as woi: as :r a voluminous research literature. Modifications can occur anywhere in a poypeotido. 
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including the peptide backbo^j^e amino acic side-chains and the amino^fcarboxyl termini. It wiit be 
appreciated that the same type of modification may be present in the same or varying degrees at 
several sites in a given polypeptide. Also, a given polypeptide may contain many types of modifications. 
Polypeptides may be branched as a result of ubiquitination, and they may be cyclic, with or without 
5 branching. Cyclic, branched and branched cyclic polypeptides may result from posttranslation natural 
processes or may be made by synthetic methods. Modifications include acetylation, acylation, ADP- 
ribosyiation, amidation, covalent attachment of flavin, covalent attachment of a heme moiety, covalent 
attachment of a nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid derivative, 
covalent attachment of phosphotidylinositol; cross-linking, cyclization, disulfide bond formation, 
10 demethylation, formation of covalent cross-links, formation of cystine, formation of pyroglutamate, 
formylation, gamma-carboxylation, giycosylation, GPl anchor formation, hydroxylation, iodination, 
methylation, myristoylation, oxidation, proteolytic processing, phosphorylation, prenylation, racemization, 
selenoylation, sulfation, transfer-RNA mediated addition of amino acids to proteins such as arginylation, 
and ubiquitination. See, for instance, PROTEINS - STRUCTURE AND MOLECULAR PROPERTIES, 
^15 2nd Ed., T. E. Creighton, W. H. Freeman and Company, New York, 1993 and Wold, F., 
™ Posttranslational Protein Modifications: Perspectives and Prospects, pgs. 1-12 in 
POSTTRANSLATIONAL COVALENT MODIFICATION OF PROTEINS, B. C. Johnson, Ed., Academic 
Press, New York, 1983; Seifter et al., "Analysis for protein modifications and nonprotein cofactors", 
Meth. Enzymol. (1990) 182:626-646 and Rattan et al., "Protein Synthesis: Posttranslational 
20 Modifications and Aging", Ann. NY Acad. Sci. (1992) 663:48-62. 

"Variant" as the term is used herein, is a polynucleotide or polypeptide that differs from a 
reference polynucleotide or polypeptide respectively, but retains essential properties. A typical variant of 
a polynucleotide differs in nucleotide sequence from another, reference polynucleotide. Changes in the 

25 nucleotide sequence of the variant may or may not alter the amino acid sequence of a polypeptide 
encoded by the reference polynucleotide. Nucleotide changes may result in amino acid substitutions, 
additions, deletions, fusions and truncations in the polypeptide encoded by the reference sequence, as 
^ discussed below. A typical variant of a polypeptide differs in amino acid sequence from another, 
reference polypeptide. Generally, differences are limited so that the sequences of the reference 

30 polypeptide and the variant are closely similar overall and, in many regions, identical. A variant and 
reference polypeptide may differ m amino acid sequence by one or more substitutions, additions, and 
deletions in any combination. A substituted or inserted amino acid residue may or may not be one 
encoded by the cenet : c code. A variant of a polynucleotide or polypeptide may be a naturally occurring 
such as an allelic variant, or it may be a variant that is not known to occur naturally. Non-naturally 

35 occurring variants of polynucleotides and polypeptides may be made by mutagenesis techniques or by 
direct synthesis. 

"Ident::." is a measure o ; the identity of nucleotide sequences or ammo acid sequences, in 
'!'.''.'.• r i : ! . the l;u:;u are so that tue hie;: .est order match .s obtained. '"Identity" per so nut. a" 
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art-recognized meaning anT can be calculated using publisn^^ techniques. See, e.g.: 
^COMPUTATIONAL MOLECULAR BIOLOGY. Lesk, A.M., ed., Oxford University P^ess, New Ycr*, 
1958; EIOCOMPUTING: INFORMATICS AND GENOME PROJECTS. Smith, CAW, ed.; Academic 
P'ess. New York, 1993; COMPUTER ANALYSIS OF SEQUENCE DATA. FART I, Griffin, A.M., and 
5 Griffin, H.G., eds., Humana Press, New Jersey. 1994; SEQUENCE ANALYSIS IN MOLECULAR 
BIOLOGY, von Heime. G.. Academic Press, 1987; and SEQUENCE ANALYSIS PRIMER, Gribskov, M. 
and Devereux, J., ecs., M Stockton Press, New York, 1991). While there exist a number of methods to 
measure identity between two polynucleotide or polypeptide sequences, the term "identity" is well 
known to skilled artisans (Carillo, H., and Upton, D., SIAM J. Applied Math. (1988) 48:1073). Methods 
10 commonly employed to determine identity or similarity between two sequences include, but are not 
limited to, those disclosed in Guide to Huge Computers, Martin J. Bishop, ed., Academic Press, San 
Diego, 1994, and Carillo, H. f and Lipton, D., SIAM J. Applied Math. (1988) 48;1073. Methods to 
determine identity and similarity are codified in computer programs. Preferred computer program 
methods to determine identity and similarity between two sequences include, but are not limited to, 
^5 GCG program package (Devereux, J., et al., Nucleic Acids Research (1984) 12(1):387), BLASTP, 
™ BLASTN, FASTA (Atschul, S.F. et al., J. Molec. Biol. (1990) 21 5:403). 

As an illustration, by a polynucleotide having a nucleotide sequence having at least, for example, 
95% "identity" to a reference nucleotide sequence of SEQ ID NO: 1 is intended that the nucleotide 

20 sequence of the polynucleotide is identical to the reference sequence except that the polynucleotide 
sequence may include up to five point mutations per each 100 nucleotides of the reference nucleotide 
sequence of SEQ ID NO: 1. in other words, to obtain a polynucleotide having a nucleotide sequence at 
least 95% identical to a reference nucleotide sequence, up to 5% of the nucleotides in the reference 
sequence may be celeted or substituted with another nucleotide, or a number of nucleotides up to 5% of 

25 the total nucleotides in the reference sequence may be inserted into the reference sequence, or in a 
number of nucleotides of up to 5% of the total nucleotides in the reference sequence there may be a 
combination of deletion, insertion and substitution. These mutations of the reference sequence may 
^ occur at the 5 or 3 terminal positions of the reference nucleotide sequence or anywhere between those 
terminal positions, interspersed either individually among nucleotides in the reference sequence or in 

30 one or more contiguous groups within the reference sequence. 

Similarly, by a polypeptide having an amino acid sequence having at least, for example. 95% 
"identity" to a reference amino acid sequence of SEQ ID NO: 2 is intended that the amino acid 
sequence of i h e polypeptice is identical to the reference sequence exceot that tne polypeptide 
35 sequence may include uo to fr.e ammo acid alterations per each 100 ammo acics of the reierence 
amino acid of SEG ID NO: 2. In cmer words, to obtain a polypeptide having an ammo acid sequence a- 
-rmst 95% identical to a reference amino acic secu^nce. up to 5% of the amino acid residues in me 
"e'cence sequence may oe deeted or substituted with another ammo acid, or a n^mce: of srrv^o a:: h's 
up : f; . of tne total ammo ac.;: -esiduos in the reference sequence may oe mserte'J m.to the refcorcu 
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is^^^e reference sequence may occur at tl^lfcii! 



sequence. These altera: ons^^^e reference sequence may occur- at tfllBrhino or car boxy terminal 
positions of the reference ammo acid sequence or anywhere between those termina! positions, 
interspersed either individually among residues in the reference sequence or in one or more contiguous 
groups within the reference sequence. 

5 

Polypeptides of the Invention 

In one aspect, the present invention relates to IGS4 polypeptides (or IGS4 proteins). The IGS4 
polypeptides include the polypeptide of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6 and SEQ ID NO: 8 

10 and the polypeptide having the amino acid sequence encoded by the DNA insert contained in the 
deposit no. CBS102221 or deposit no. CBS102221, deposited on September 24, 1999 at the 
Centraalbureau voor Schimmelcultures at Baarn the Netherlands; as well as polypeptides comprising 
the amino acid sequence of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6 or SEQ ID NO: 8 and the 
polypeptide having the amino acid sequence encoded by the DNA insert contained in the deposit no. 
J 5 CBS1 02221 or deposit no. CBS1 02222 at the Centraalbureau voor Schimmelcultures at Baarn the 
" Netherlands and polypeptides comprising the amino acid sequence which have at least 80% identity to 
that of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6 or SEQ ID NO: 8 and/or the polypeptide having the 
amino acid sequence encoded by the DNA insert contained in the deposit no. CBS1 02221 or deposit no. 
CBS1 02222 at the Centraalbureau voor Schimmelcultures at Baarn the Netherlands over its entire 

20 length, and stiil more preferably at least 90% identity, and even still more preferably at least 95% identity 
to said amino acid sequences. Furthermore, those with at least 97%, in particular at least 99%, are 
highly preferred. Also included within IGS4 polypeptides are polypeptides having the amino acid 
sequence which have at least 80% identity to the polypeptide having the amino acid sequence of SEQ 
ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6 or SEQ ID NO: 8 or the polypeptide having the amino acid 

25 sequence encoded by the DNA insert contained in the deposit no. CBS102221 or deposit 
no. CBS1 02222 at the Centraalbureau voor Schimmelcultures at Baarn the Netherlands over its entire 
length, and still more preferably at least 90% identity, and even still more preferably at least 95% identity 
^ to SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6 or SEQ ID NO: 8. Furthermore, those with at least 97%, 
in particular at least 99% are highly preferred. Preferably IGS4 polypeptides exhibit at least one 

30 biological activity of the receptor. 

The IGS^ polypeptides may be in the form of the "mature" protein or may be a part of a larger 
protein such as a fusion protein. It is often advantageous to include an additional amino acid sequence 
which contains secretory or leader sequences, pro-sequences, sequences which aid in purification 
25 such as multiple histidine residues, or an additional sequence for stability curing recombinant 
production. 

Fragments of the IGS4 polypeptides are also included in the invention. A fragment is a 
:>o\ ;>.-piic:o having an am.no nc z sequence that entirely is the same as part, but n::t all. of the amir "■ 
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acic sequence of tne aforem 




cned IGS4 polypeptides. As with IGS4 




ptides, fragments may be 



"free-stancing," or comprised within a larger polypeptide of which they form, a part or region, most 
preferably as a single continuous region. Representative examples cf polypeptide fragments of the 
invention, include, for example, fragments from about amino acid number 1-20; 21-40, 41-60, 61-60, 81- 
100; and 101 to the end of IGS4 polypeptide. In this context "about" includes the particularly recited 
ranees larger or smaller by several, 5, 4, 3, 2 or 1 amino acid at either extreme or at both extremes. 

Preferred fragments include, for example, truncation polypeptides having the amino acid 
sequence of IGS4 polypeptides, except for deletion of a continuous series of residues that includes the 
amino terminus, or a continuous series of residues that includes the carboxyl terminus or deletion of two 
continuous series of residues, one including the amino terminus and one including the carboxyl 
terminus. An example of a truncated polypeptide according to the present invention is the polypetide of 
SEQ ID NO: 10 and SEQ ID NO: 12, which is encoded by the polynucleotide of SEQ ID NO: 9 
respectively SEQ ID NO: 11. Also preferred are fragments characterized by structural or functional 
attributes such as fragments that comprise alpha-helix and alpha-helix forming regions, beta-sheet and 
beta-sheet-forming regions, turn and turn-forming regions, coil and coil-forming regions, hydrophilic 
regions, hydrophobic regions, alpha amphipathic regions, beta amphipathic regions, flexible regions, 
surface-forming regions, substrate binding region, and high antigenic index regions. Other preferred 
fragments are biologically active fragments. Biologically active fragments are those that mediate 
receptor activity, including those with a similar activity or an improved activity, or with a decreased 
undesirable activity. Also included are those that are antigenic or immunogenic in an animal, especially 
in a human. 

Thus, the polypeptides of the invention include polypeptides having an amino acid sequence at 
least 80% identical to that of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6 or SEQ ID NO: 8 and/or the 
polypeptide having the amino acid sequence encoded by the DNA insert contained in the deposit no. 
CBS1 02221 or the deposit no. CBS1 02222 at the Centraaibureau voor Schimmelcultures at Baarn the 
Netherlands, or fragments thereof with at least 80% identity to the corresponding fragment. Preferably, 
all of these polypeptide fragments retain the biological activity of the receptor, including antigenic 
activity. Variants of the defined sequence and fragments also form part of the present invention. 
Preferred variants are those that vary from the referents by conservative amino acid substitutions - i.e., 
those that substitute a residue with another of like characteristics. Typical such substitutions are among 
Ala. Val, Leu and lie; among Ser and Thr: among the acidic residues Asp and Glu; among Asn and Gin; 
and among tne basic residues Lys and Arg; or aromatic residues Phe and Tyr. Particularly preferred are 
variants in which several, 5-10. 1-5, or 1-2 amino acids are substituted, deleted, or added in any 
combination. 
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Table 3: IGS4A-64-DNA of 



NO: 9 and SEQ ID NO: 11 



GGCxCAGCI - GAAACAGAGCCTCGTA.CCAGGGGAGGCTCAGGCCTTGGA1 i . i.— —-vT^Ts..—. 
GG G AT G 3 AAAAAC TT C AG AAT G CTT C CT G G AT C T AC C AG C AG AAACT AG AAG ATC C ATT C 
CAGAAACACCTGAACAGCACCGAGGAGTATCTGGCCTTCCTCTGCGGACCTCGGCGCAGC 
CACTTCTTCCTCCCCGTGTCTGTGGTGTATGTGCCAATTTTTGTGGTGGGGGTCATTGGC 
AATGTCCTGGTGTGCCTGGTGATTCTGCAGCACCAGGCTATGAAGACGCCCACCAACTAC 
TACCTCTTCAGCCTGGCGGTCTCTGACCTCCTGGTCCTGCTCCTTGGAATGCCCCTGGAG 
1 0 GTCTATGAGATGTGGCGCAACTACCCTTTCTTGTTCGGGCCCGTGGGCTGCTACTTCAAG 
ACGGCCCTCTTTGAGACCGTGTGCTTCGCCTCCATCCTCAGCATCACCACCGTCAGCGTG 
GAGCGCTACGTGGCCATCCTACACCCGTTCCGCGCCAAACTGCAGAGCACCCGGCGCCGG 
GCCCTCAGGATCCTCGGCATCGTCTGGGGCTTCTCCGTGCTCTTCTCCCTGCCCAACACC 
AGCATCCATGGCATCAAGTTCCACTACTTCCCCAATGGGTCCCTGGTCCCAGGTTCGGCC 
1 5 ACCTGTACGGTCATCAAGCCCATGTGGATCTACAATTTCATCATCCAGGTCACCTCCTTC 
CTATTCTACCTCCTCCCCATGACTGTCATCAGTGTCCTCTACTACCTCATGGCACTCAGA 
C T AAAG AAAG AC AAAT CT C TT G AG G C AG AT G AAG GG AATG C AAAT ATTC AAAG AC C C T G C 
AGAAAATCAGTCAACAAGATGCTGTCTTTGTGGAGGAGTGGAGTGAATCCCTGGCTGCTG 
TGTTCAACCTCGTCCATGTGGTGTCAGGTGTCTTCTTCTACCTGAGCTCAGCTGTCAACC 
20 CCATTATCTATAACCTACTGTCTCGCCGCTTCCAGGCAGCATTCCAGAATGTGATCTCTT 
CTTTCCACAAACAGTGGCACTCCCAGCATGACCCACAGTTGCCACCTGCCCAGCGGAACA 
TCTTCCTGACAGAATGCCACTTTGTGGAGCTGACCGAAGATATAGGTCCCCAATTCCCAT 
GTCAGTCATCCATGCACAACTCTCACCTCCCAACAGCCCTCTCTAGTGAACAGATGTCAA 
G AAC AAACT AT C AAAG CTTCCAC T TT AAC AAAAC CTG AA.TT CTTTC AG AG CTG ACT CT C C 
25 TCTATGCCTCAAAACTTCAGAGAGGAACATCCCATAATGTATGCCTTCTCATATGATATT 
AGAGAGGTAGAATGGCTCTTACAACTCATGTACCCATTGCTAGTTTTTTTTTTTTAATAA 
AC G TG AAAAC T GAG AG TTAGATCTGGTTTC AAAAC C C AAG A C T G C C T G AT TTTTAGTTAT 
CTTTCCACTA.TCCTAACTGCCTCATGCCCCTTCACTAGTTCATGCCAAGAACGTGACTGG 
AAAGGCATGGCACCTATACCTTGATTAATTTCCATTAATGGAAATGGTTCGTCCTGAGTC 
30 ATCTACGTTCCGAGTCAGGCTGTCACTCCTACTA- 3 ' 



Table 6: IGS4A-64-protem of SEQ ID NO: 10 and SEQ ID NO: 12 (without three amino acids between 
brackets). 



35 
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• the invention can be preparec in any suita^^pn« 



"me GS4 polypeptices of the invention can be prepared in any sui:a^^p-,anner. Such polypeptides 
incijde isolated naturally occurring polypeptices, recombinants produced polypeptides, synthetically 
prccucec polypeptices, or polypeptides croauced by a combination of these methods. Methods for 
preparing such polypeptides are well known in the art. 

Polynucleotides of the Invention 

A further aspect of the invention relates to IGS4 polynucleotides. IGS4 polynucleotides include 
isolated polynucleotides which encode the IGS4 polypeptides (including IGS4A and 1GS4B) and 
10 fragments, and polynucleotides closely related thereto. More specifically, the IGS4 polynucleotide of the 
invention includes a polynucleotide comprising the nucleotide sequence contained in SEQ ID NO: 1, 
SEQ ID NO: 3, SEQ ID NO: 5 or SEQ ID NO: 7 encoding a IGS4A polypeptide of SEQ ID NO: 2 or of 
SEQ ID NO: 4 and a IGS4B polypeptide of SEQ ID NO: 6 or of SEQ ID NO: 8 respectively, 
polynucleotides having the particular sequence of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 or SEQ 
.15 ID NO: 7 and polynucleotides which essentially correspond to the DNA insert contained in the deposit 
" no. CBS102221 or the deposit no. CBS1 02222 at the Centraalbureau voor Schimmelcultures at Baarn 
the Netherlands. 

IGS4 polynucleotides further include a polynucleotide comprising a nucleotide sequence that 
20 has at least 80% identity over its entire length to a nucleotide sequence encoding the IGS4 polypeptide 
of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6 or SEQ ID NO: 8, a polynucleotide comprising a 
nucleotide sequence that is at least 80% identical to that of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 
5 or SEQ ID NO: 7 over its entire length and a polynucleotide which essentially correspond to the DNA 
insert contained in the deposit no. CBS102221 or the deposit no. CBS102222 at the Centraalbureau 
25 voor Schimmelcultures at Baarn the Netherlands. 

in this regard, polynucleotides with at least 90% identity are particularly preferred, and those with at 
least 95% are especially preferred. Furthermore, those with at least 97% are highly preferred and those 
^ with at least 98-99% are most highly preferred, with at least 99% being the most preferred. Also included 
* under IGS4 polynucleotides are a nucleotide sequence which has sufficient identity to a nucleotide 

30 sequence contained in SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 or SEQ ID NO: 7 or to the DNA 
insert contained in the deposit no. CBS1 02221 or in the deposit no. CBS102222 at the Centraalbureau 
voor Schimmelcultures at Baarn the Netherlands to hybridize under conditions useable for amplification 
or for use as a probe or marker. Tne invention also provides polynucleotides which are complementary 
to such IGS4 polynucleotides. 

35 

IGS4 of the invention is structurally related to other proteins of the G-O'otein coupled receptor 
*'amiiy, as shown by tre results of 5LAST searches in the public databases. The amino ac;d sequence o"" 
Taoie 1 (SEQ ID NC: 2) has about 46 % identity (using BLAST, Aitschul S.F. et ah [1997]. Nucleic Acids 
Res. 25:3229-3402; ever most c*' its length (ammo acic residues 21-338 - w.tn a human orphan 3- 
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# 043664, Tan et al., Genomics 52(2^^^-; 



protein coupiec receptor (Acc^J~ # 043664, Tan et al., Genomics 52v2j^J-229 (1998). There is 27 
°o homology (over amino acic resicues 61-349 ) to the rat neurotensin 1 receptor (Accession # P20789 
Tanaka K.et al, Neuron 4:847-85^ (1990)). The nucleotide sequence of Table 1 (SEQ ID NO: 1) is 63 % 
identical to an orphan G-protein coupled receptor over nucleotide residues 120-864 (Accession # 
AF044600, corresponding with the protein sequence 043664). Furthermore, there is 53 % identity to the 
human growth hormone secretagogue receptor over residues 94-1137 ( Howard A.D.et ai, Science 
273:974-977(1996). Thus, IGS4 polypeptides and polynucleotides of the present invention are expected 
to have, inter alia, similar biological functions/properties to their homologous polypeptides and 
polynucleotides, and their utility is obvious to anyone skilled in the art. 



Polynucleotides of the invention can be obtained from natural sources such as genomic DNA. 
In particular, degenerated PCR primers can be designed that encode conserved regions within a 
particular GPCR gene subfamily. PCR amplification reactions on genomic DNA or cDNA using the 
degenerate primers will result in the amplification of several members (both known and novel) of the 
15 gene family under consideration (the degenerated primers must be located within the same exon, when 
^ a genomic template is used). (Libert et al., Science, 1989, 244: 569-572). Polynucleotides of the 
invention can also be synthesized using well-known and commercially available techniques. 

The nucleotide sequence encoding the IGS4 polypeptide of SEQ ID NO: 2, SEQ ID NO: 4, SEQ 
20 ID NO: 6 or SEQ ID NO: 8 may be identical to the polypeptide encoding sequence contained in SEQ ID 
NO: 1 (nucleotide number 55 to 1299) or SEQ ID NO: 3 (nucleotide number 64 to 1299), or SEQ ID NO: 
5 (nucleotide number 55 to 1299) or SEQ ID NO: 7 (nucleotide number 64 to 1299) respectively, or it 
may be a different nucleotide sequence, which as a result of the redundancy (degeneracy) of the 
genetic code might also show alterations compared to the polypeptide encoding sequence contained in 
25 SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 or SEQ ID NO: 7, but also encodes the polypeptide of 
SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6 or SEQ ID NO: 8, respectively. 

^ When the polynucleotides of the invention are used for the recombinant production of the IGS4 

polypeptide, the polynucleotide may include the coding sequence for the mature polypeptide or a 

30 fragment thereof, by itself; the coding sequence for the mature polypeptide or fragment in reading frame 
with other coding sequences, such as those encoding a leader or secretory sequence, a pre-, or pro- or 
prepro- protein sequence, or other fusion peptide portions. For example, a marker sequence which 
facilitates purification o £ the fusee polypeptide can be encoded. In certain preferred embodiments of this 
aspect of the invention, the marker sequence is a hexa-histidine peptide, as provided in the pQE vector 

35 (Qiacen. Inc.) and described in Gentz et al., Proc. Natl. Acad. Sci USA (1989) 86:821 -82^, or is an HA 
tag. The polynucleotide may also contain non-coding 5' and 3' sequences, such as transcribed, non- 
Vc'-nsiatod sequences, splicing arc polyadenylatson signals, ribosome binding sites and sequences tnat 
:,:abi!:ze mRNA. 
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nts comprising the amine 



acid sequence of the 1GS4 polypeptide of SEG ID NO: 2, SEG ID NC: 4, SEQ ID NO: 6 or SEG ID NO: 
8 in which several, 5-10, 1-5, 1-3. '-2 or 1 amino acid residues are substituted, deleted or added, in any 



The present invention further relates to polynucleotides that hybridize to the herein above- 
described sequences. In this regard, the present invention especially relates to polynucleotides which 
hybridize under stringent conditions to the herein above-described polynucleotides. As herein used, the 
term "stringent conditions" means nybridization will occur only if there is at least 80%, and preferably at 
least 90%, and more preferably at least 95%, yet even more preferably 97, in particular at least 99% 
identity between the sequences. 

Polynucleotides of the invention, which are identical or sufficiently identical to a nucleotide 
sequence contained in SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 or SEQ ID NO: 7 or a fragment 
thereof, may be used as hybridization probes for cDNA and genomic DNA, to isolate fuli-length cDNAs 
and genomic clones encoding IGS4 and to isolate cDNA and genomic clones of other genes (including 
genes encoding homologs and orthologs from species other than human) that have a high sequence 
similarity to the IGS4 gene. People skilled in the art are well aware of such hybridization techniques. 
Typically these nucleotide sequences are 80% identical, preferably 90% identical, more preferably 95% 
identical to that of the referent. The probes generally will comprise at least 15 nucleotides. Preferably, 
such probes will have at least 30 nucleotides and may have at least 50 nucleotides. Particularly 
preferred probes will range between 30 and 50 nucleotides. 

One embodiment, to obtain a polynucleotide encoding the IGS4 polypeptide, including homologs 
and orthologs from species other than human, comprises the steps of screening an appropriate library 
under stringent hybridization conditions with a labeled probe having the SEQ ID NO: 1, SEQ ID NO: 3, 
SEQ ID NO: 5 or SEQ ID NO: 7 or a fragment thereof, and isolating full-length cDNA and genomic 
ciones containing said polynucleotide sequence. Such hybridization techniques are well known to 
those of skill in the art. Stringent hybridization conditions are as defined above or alternatively conditions 
under overnight incubation at 42 : C in a solution comprising: 50% formamide, 5xSSC (150mM NaCI, 
15mM trisodium citrate), 50 mM sodium phosphate (pH7.6), 5x Denhardt's solution, 10 % dextran 
sulfate, and 20 microgram mi denatured, sneared salmon sperm DNA, followed by washing the filters in 
O.lxSSC at about 65"C. 

The polynucleotides and pc /peptides of the present invention may be used as research reagents 
and materials for discovery c: treatments and diagnostics to anima! and human disease. 



combination. 



Vectors, Host Cells, E;^Bsior, 
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The present invention also relates to vectors which comprise a polynucleotide or polynucleotides 
of the present invention, and host cells which are genetically engineered with vectors of the invention 
5 and to the production of polypeptides of the invention by recombinant techniques. Cell-free translation 
systems can also be used to produce such proteins using RNAs derived from the DNA constructs of the 
present invention. 

For recombinant production, host cells can be genetically engineered to incorporate expression 
10 systems or portions thereof for polynucleotides of the present invention. Introduction of polynucleotides 
into host ceils can be effected by methods described in many standard laboratory manuals, such as 
Davis et al., BASIC METHODS IN MOLECULAR BIOLOGY (1986) and Sambrook et al., MOLECULAR 
CLONING: A LABORATORY MANUAL, 2nd Ed., Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, N.Y. (1989) such as calcium phosphate transfection, DEAE-dextran mediated transfection, 
15 transvection, microinjection, cationic lipid-mediated transfection, electroporation, transduction, scrape 
" loading, ballistic introduction or infection. 

Representative examples of appropriate hosts include bacterial cells, such as streptococci, 
staphylococci, E. coli, Streptomyces and Bacillus subtilis cells; fungal cells, such as yeast cells and 
20 Aspergillus cells; insect cells such as Drosophila S2 and Spodoptera Sf9 cells; animal cells such as 
CHO, COS, HeLa. C127, 3T3, BHK, HEK 293 and Bowes melanoma cells; and plant cells. 

A great variety of expression systems can be used. Such systems include, among others, 
chromosomal, episomai and virus-derived systems, e.g., vectors derived from bacterial plasmids, from 

25 bacteriophage, from transposons, from yeast episomes, from insertion elements, from yeast 
chromosomal elements, from viruses such as baculoviruses, papova viruses, such as SV40, vaccinia 
viruses, adenoviruses, fowl pox viruses, pseudorabies viruses and retroviruses, and vectors derived 

^ from combinations thereof, such as those derived from plasmid and bacteriophage genetic elements, 
such as cosmids and phagemids. The expression systems may contain control regions that regulate as 

30 well as engender expression. Generally, any system or vector suitable to maintain, propagate or 
express polynucleotides to produce a polypeptide in a host may be used. The appropriate nucleotide 
sequence may be inserted into an expression system by any of a variety of well-known and routine 
techniques, such as, for example, those set forth in Sambrook et al, MOLECULAR CLONING, A 
LABORATORY MANUAL (supra). 

35 

For secretion of the translated protein into the lumen of the endoplasmic reticulum, into the 
pe-ioiasmic space or into the extracellular environment, appropriate secretion signals may be 
incorporated into the desired polypeptide. These signals may be endogenous to the polypeptide or they 
may be heterologous signals. 
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if the IGS4 polypeptide is to be expressec for use in screening assays, generally, it is preferred 
that the polypeptice oe produced at the surface of the cell. In this event, the cells may be harvested prior 
to use in the screening assay, in case the affinity cr functional activity of the IGS4 polypeptide is 
modified by receptor activity modifying proteins (RAMP), coexpression of the relevant RAMP most likely 
at the surface of the cell is preferred and often required. Also in this event harvesting of cells expressing 
the IGS4 polypeptide and the relevant RAMP prior to use in screening assays is required. If the IGS4 
polypeptide is secreted into the medium, the medium can be recovered in order to recover and purify the 
polypeptide; if produced intracellular^, the cells must first be iysed before the polypeptide is recovered. 



IGS4 polypeptides can be recovered and purified from recombinant cell cultures by well-known 
methods including ammonium sulfate or ethanol precipitation, acid extraction, anion or cation exchange 
chromatography, phosphocellulose chromatography, hydrophobic interaction chromatography, affinity 
chromatography, hydroxylapatite chromatography and lectin chromatography. Most preferably, high 
15 performance liquid chromatography is employed for purification. Well-known techniques for refolding 
" proteins may be employed to regenerate active conformation when the polypeptide is denatured during 
isolation and or purification. 



Diagnostic Assays 



This invention also relates to the use of IGS4 polynucleotides for use as diagnostic reagents. Detection 
of a mutated form of the IGS4 gene associated with a dysfunction will provide a diagnostic too! that can 
add to or define a diagnosis of a disease or susceptibility to a disease which results from under- 
expression, over-expression or altered expression of IGS4. Also in this event co-expression of relevant 
25 receptor activity modifying proteins can be required to obtain diagnostic assays of desired quality. 
Individuals carrying mutations in the IGS4 gene may be detected at the DNA level by a variety of 
techniques. 

^ Nucleic acids for diagnosis may be obtained from a subject's cells, such as from blood, urine, 

saliva, tissue biopsy or autopsy material. The genomic DNA may be used directly for detection or may 

30 be amplified enzymatically by using PCR or other amplification techniques prior to analysis. RNA or 
cDNA may also be used in similar fashion. Deletions and insertions can be detected by a change in size 
of the amplified product in comparison to the normal genotype. Point mutations can be identified by 
hybridizing amplified DNA to labeled IGS4 nucleotide sequences. Perfectly matched sequences can be 
distinguished from mismatched duplexes by RNase digestion or by differences in melting temperatures. 

25 DNA sequence differences may also be detected by alterations in electrophoretic mobility of DNA 
fragments in gels, with or without denaturing agents, or by direct DNA sequencing. See. e.g., Myers et 
a!.. Science (1S85) 230:1242. Sequence changes at specific locations may aiso be revealed D .' 
nuclease protection assays, sucn as RNase and SI protection cr the chemical ceavage method. See 
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oligonucleotide probes comj^^ the IGS4 nucleotide sequence or^^ments thereof can be 
constructed to conduct efficient screening of e.g., genetic mutations. Array technology methods are well 
known and have genera! applicability and can be used to address a variety of questions in molecular 
genetics including gene expression, genetic linkage, and genetic variability. (See for example: K4.Chee 
5 et al., Science, Vol 274, pp 610-613 (1996)). 

The diagnostic assays offer a process for diagnosing or determining a susceptibility to CNS 
disorders, including schizophrenia, episodic paroxysmal anxiety (EPA) disorders such as obsessive 
compulsive disorder (OCD), post traumatic stress disorder (PTSD), phobia and panic, major depressive 
10 disorder, bipolar disorder, Parkinson's disease, general anxiety disorder, autism, delirium, Alzheimer 
disease/dementia, severe mental retardation and dyskinesias, such as Huntington's disease or Gilles 
dela Tourett's syndrome, anorexia, bulimia, stroke, addiction/dependency/craving, sleep disorder, 
epilepsy, migraine; attention deficit/hyperactivity disorder (ADHD); cardiovascular disease including 
heart failure, angina pectoris, arrhythmias, myocardial infarction, hypotension; hypertension, thrombosis; 
^5 dyslipidemias; obesity; emesis; IBS; IBD; GERD; conditions of delayed gastric emptying, such as 
™ postoperative or diabetic gastroparesis; osteoporosis; infections such as bacterial, fungal, protozoan 
and viral infections, particularly infections caused by HIV-1 or HIV-2; pain; cancers; urinary retention; 
ulcers; asthma; allergies and benign prostatic hypertrophy, through detection of mutation in the IGS4 
gene by the methods described. 

20 

In addition, CNS disorders, including schizophrenia, episodic paroxysmal anxiety (EPA) disorders 
such as obsessive compulsive disorder (OCD), post traumatic stress disorder (PTSD), phobia and 
panic, major depressive disorder, bipolar disorder, Parkinson's disease, general anxiety disorder, 
autism, delirium, Alzheimer disease/dementia, severe mental retardation and dyskinesias, such as 

25 Huntington's disease or Gilles dela Tourett's syndrome, anorexia, bulimia, stroke, 
addiction/dependency/craving, sleep disorder, epilepsy, migraine; attention deficit/hyperactivity disorder 
(ADHD); cardiovascular disease including heart failure, angina pectoris, arrhythmias, myocardial 
^ infarction, hypotension; hypertension, thrombosis; dyslipidemias; obesity; emesis; IBS; IBD; GERD; 
conditions of delayed gastric emptying, such as postoperative or diabetic gastroparesis; osteoporosis; 

30 infections such as bacterial, fungal, protozoan and viral infections, particularly infections caused by HIV- 
1 or HIV-2; pain; cancers; urinary retention; ulcers; asthma; allergies and benign prostatic hypertrophy, 
can be diagnosed by methods comprising determining from a sample derived from a subject an 
abnormally decreased or increased level of the IGS4 polypeptide or iGS4 mRNA. Decreased or 
increased expression can be measured at the RNA level using any of the methods well known in the art 

35 for the quantitation of polynucleotides, such as, for example, PCR, RT-PCR. RNase protection, 
Northern blotting and other hybridization methods. Assay techniques that can be used to determine 
levels of a protein, such as an !GS4. in a sample derived from a host are well known to those of skill in 
the ant. Such assay methods i.ncude radioimmunoassays, competitive-binding assays, Western Blot 
nr:cVvs:s and ELISA assays. 
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In another aspect, the present invention relates to a ciagonostic kit for a disease or suspectability 
to a cisease, particularly CNS disorders, including schizophrenia, episodic paroxysmal anxiety (EPA) 
disorders such as obsessive compulsive disorder (OCD), post traumatic stress disorder (PTSD), phobia 
5 and panic, major depressive disorder, bipolar disorder, Parkinson's disease, general anxiety disorder, 
autism, delirium. Alzheimer disease/dementia, severe mental retardation and dyskinesias, such as 
Huntington's disease or Giiles dela Tourett's syndrome, anorexia, bulimia, stroke, 
addiction/dependency/craving, steep disorder, epilepsy, migraine; attention deficit/hyperactivity disorder 
(ADHD); cardiovascular disease including heart failure, angina pectoris, arrhythmias, myocardial 
10 infarction, hypotension; hypertension, thrombosis; dysiipidemias; obesity; emesis; IBS; IBD; GERD; 
conditions of delayed gastric emptying, such as postoperative or diabetic gastroparesis; osteoporosis; 
infections such as bacterial, fungal, protozoan and viral infections, particularly infections caused by HIV- 
1 or HIV-2; pain; cancers; urinary retention; ulcers; asthma; allergies and benign prostatic hypertrophy, 
which comprises: 

^5 (a) an 1GS4 polynucleotide, preferably the nucleotide sequence of SEQ ID NO: 1. SEQ ID NO: 

' 3, SEQ ID NO: 5 or SEQ ID NO: 7, or a fragment thereof; 

(b) a nucleotide sequence complementary to that of (a); 

(c) an IGS4 polypeptide, preferably the polypeptide of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID 
NO: 6 or of SEQ ID NO: 8, or a fragment thereof; or 

20 (d) an antibody to an IGS4 polypeptide, preferably to the polypeptide of SEQ ID NO: 2, SEQ ID 

NO: 4, SEQ ID NO: 6 or of SEQ ID NO: 8. 
(e) a RAMP polypeptide required for the relevant biological or antigenic properties of an IGS4 
polypeptide 

25 It will be appreciated that in any such kit, (a), (b), (c) (d) or (e) may comprise a substantial component. 
Chromosome Assays 

^ The nucleotide sequences of the present invention are also valuable for chromosome 

30 identification. The sequence is specifically targeted to and can hybridize with a particular location on an 
individual human chromosome. The mapping of relevant sequences to chromosomes according to the 
present invention is an important first step in correlating those sequences with gene associated 
disease. Once a sequence has been mapped to a precise chromosomal location, the physical position 
of the sequence on the chromosome can be correlated with genetic map data. Such data are found, for 
25 example, in V. McKusick. Mence.ian Inheritance in Man (available on line through Johns Hopkins 
University Welch Medical L : bra~y;. The relationship between genes and diseases that have been 
ma oped to tne same c-rcmcsc~a -egion are then identified through linkage anays:s : coinheritar.ee o* 
;;nys:ca!iy adjacent ceres). 
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The cifferences in the or genomic sequence between aftectl^^c unaffected individuals 

can also be determined. If a mutation is observed in some or aii of the affected individuals but not in any 
normal individuals, then the mutation is likely to be the causative agent of the disease. 

5 Antibodies 

The polypeptides of the invention or their fragments or analogs thereof, or cells expressing them if 
required together with relevant RAMP's, may also be used as immunogens to produce antibodies 
immunospecific for the iGS4 polypeptides. The term "immunospecific" means that the antibodies have 
10 substantial! greater affinity for the polypeptides of the invention than their affinity for other related 
polypeptides in the prior art. 

Antibodies generated against the IGS4 polypeptides may be obtained by administering the 
polypeptides or epitope-bearing fragments, analogs or cells to an animal, preferably a nonhuman, using 
15 routine protocols. For preparation of monoclonal antibodies, any technique, which provides antibodies 
^ produced by continuous cell line cultures, may be used. Examples include the hybridoma technique 
(Kohler, G. and Milstein, C, Naure (1975) 256:495-497), the trioma technique, the human B-cell 
hybridoma technique (Kozbor et al. r Immunology Today (1983) 4:72) and the EBV-hybridoma technique 
(Cole et al., MONOCLONAL ANTIBODIES AND CANCER THERAPY, pp. 77-96, Alan R. Liss, Inc., 
20 1985). 

The above-described antibodies may be employed to isolate or to identify clones expressing the 
polypeptide or to purify the polypeptides by affinity chromatography. 

25 Antibodies against IGS4 polypeptides as such or against IGS4polypeptide-RAMP complexes, may also 
be employed to treat CNS disorders, including schizophrenia, episodic paroxysmal anxiety (EPA) 
disorders such as obsessive compulsive disorder (OCD), post traumatic stress disorder (PTSD), phobia 
^ and panic, major depressive disorder, bipolar disorder, Parkinson's disease, general anxiety disorder, 
™ autism, delirium, Alzheimer disease/dementia, severe mental retardation and dyskinesias, such as 

30 Huntington's disease or Gilles dela Touretfs syndrome. anorexia, bulimia, stroke, 
addiction/dependency/craving, sleep disorder, epilepsy, migraine; attention deficit/hyperactivity disorder 
(ADHD); cardiovascular disease including heart failure, angina pectoris, arrhythmias, myocardial 
infarction, hypotension; hypertension, thrombosis; dyslipidemias; obesity; emesis; IBS; lBD; GERD; 
conditions of delayed gastric emptying, such as postoperative or diabetic gastroparesis; osteoporosis; 

35 infections such as bacterial, fungal, protozoan and viral infections, particularly infections caused by HIV- 
1 or HIV-2; pain; maycers; urinary retention; ulcers; asthma; allergies and benign prostatic hypertrophy, 
a mono others 
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Animals 

Ar,other aspect of the invention reiates to a animai-based systems which act as models for 
disorders arising from aberrant expression or activity of IGS4. Animal based model systems may also be 
5 used to further characterize the activity of the 1GS4 gene. Such assays may be utilized as part of 
screening strategies designed to identify compounds which are capable to treat IGS4 based disorders 
such as CNS disorders, including schizophrenia, episodic paroxysmal anxiety (EPA) disorders such as 
obsessive compulsive disorder (GCD), post traumatic stress disorder (PTSD), phobia and panic, major 
depressive disorder, bipolar disorder, Parkinson's disease, general anxiety disorder, autism, delirium, 
10 Alzheimer disease/dementia, severe mental retardation and dyskinesias, such as Huntington's disease 
or Gilles dela Tourett's syndrome, anorexia, bulimia, stroke, addiction/dependency/craving, sleep 
disorder, epilepsy, migraine; attention deficit/hyperactivity disorder (ADHD); cardiovascular disease 
including heart failure, angina pectoris, arrhythmias, myocardial infarction, hypotension; hypertension, 
thrombosis; dyslipidemias; obesity; emesis; IBS; IBD; GERD; conditions of delayed gastric emptying, 
^5 such as postoperative or diabetic' gastroparesis; osteoporosis; infections such as bacterial, fungal, 
™ protozoan and viral infections, particularly infections caused by HIV-1 or HIV-2; pain; cancers; urinary 
retention; ulcers; asthma; allergies and benign prostatic hypertrophy. In this way the animal-based 
models may be used to identify pharmaceutical compounds, therapies and interventions which may be 
effective in treating disorders aberrant expression or activity of IGS4. In addition such animal models 
20 may be used to determine the LD 50 and the EDso in animal subjects. These data may be used to 
determine the in vivo efficacy of potential IGS4 disorder treatments. 

Animal-based mode! systems of IGS4 based disorders, based on aberrant IGS4 expression or activity, 
may include both non-recombinant animals as well as recombinantiy engineered transgenic animals. 

25 Animal models for IGS4 disorders may include, for example, genetic models. Animai models 

exhibiting IGS4 based disorder-like symptoms may be engineered by utilizing, for example, IGS4 
sequences such as those described, above, in conjunction with techniques for producing transgenic 
^ animals that are well known to persons skilled in the art. For example, IGS4 sequences may be 
introduced into, and overexpressed and/or misexpressed in, the genome of the animai of interest, or, if 

30 endogenous IGS4 sequences are present, they may either be overexpressed, misexpressed, or, 
alternatively, may be disrupted in order to underexpress or inactivate IGS4 gene expression. 

In order to ove r express or misexpress a IGS4 gene sequence, the coding portion of the 1GS4 
gene sequence may be licsted to a regulatory sequence which is capable of driving hign level gene 
25 expression or expression ir a eel: type in which the gene is not normally expressed in the animal type of 
interest. Such regulatory regions will be well known to those skilled in the art. and may be utilized in the 
absence of undue exoerimentatic-. 
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c^J^en oogenous IGS4 gene sequence, such^^^c 



For underexpression d^^endogenous IGS4 gene sequence, sucH^Wequence may be isolated 
and engineered such that when reintroduced into the genome of the animal of interest, the endogenous 
IGS4 gene alleles will be inactivated, or "knocked-out". Preferably, the engineered IGS4 gene sequence 
is introduced via gene targeting such that the endogenous IGS4 sequence is disrupted upon integration 
5 of the engineered IGS4 gene sequence into the animal's genome. 
Gene targeting is discussed, below, in this section. 

Animals of any species, including, but not limited to, mice, rats, rabbits, guinea pigs, pigs, micro- 
pigs, goats, and non-human primates, e.g., baboons, squirrels, monkeys, and chimpanzees may be 
10 used to generate animal models of IGS4 related disorders. 

Any technique known in the art may be used to introduce a IGS4 transgene into animals to 
produce the founder lines of transgenic animals. Such techniques include, but are not limited to 
pronuclear microinjection (Hoppe, P.C. and Wagner, T.E., 1989, U.S. Pat. No. 4,873,191); retrovirus 
^5 mediated gene transfer into germ Sines (van der Putten et al., Proc. Natl. Acad. Sci., USA 82:6148- 
™ 6152, 1985); gene targeting in embryonic stem cells (Thompson et al., Cell 56:313-321, 1989,); 
electroporation of embryos (Lo, Mol. Cell. Biol. 3:1 803-1 B14, 1983); and sperm-mediated gene transfer 
(Lavitrano et al., Cell 57:717-723, 1989); etc. For a review of such techniques, see Gordon, Transgenic 
Animals, Intl. Rev. Cytol. 1 1 5: 1 71 -229, 1989, which is incorporated by reference herein in its entirety. 

20 

The present invention provides for transgenic animals that carry the IGS4 transgene in all their 
cells, as well as animals which carry the transgene in some, but not all their cells, Le^, mosaic animals. 
(See, for example, techniques described by Jakobovits, Curr. Biol. 4:761-763, 1994) The transgene may 
be integrated as a single transgene or in concatamers, e.g., head-to-head tandems or head-to-tail 
25 tandems. The transgene may also be selectively introduced into and activated in a particular cell type by 
following, for example, the teaching of Lasko et al. (Lasko, M. et a!., Proc. Natl. Acad. Sci. USA 89:6232- 
6236, 1992). 

The regulatory sequences required for such a cell-type specific activation will depend upon the 
30 particular cell type of interest, and will be apparent to those of skill in the art. 

When it is desired that the IGS4 transgene be integrated into the chromosomal site of the 
endogenous IGS4 gene, gene targeting is preferred. Briefly, when such a technique is to be utilized, 
vectors containing some nucleotide sequences homologous to the endogenous IGS4 gene of interest 
35 (e.g., nucleotide sequences of the mouse !GS4 gene) are designed for the purpose of integrating, via 
homologous recombination with chromosomal sequences, into and disrupting the function of, the 
nucleotide sequence of the endogenous IGS4 gene The transgene may also be selectively introduced 
into a particular cell type, thus inactivating the endogenous gene of interest n only that cell type, by 
''o:'-:wing, for example, the teach. ng of Gu et al. fGu, H. et a!.. Science 265:103-106. 1994). The 
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regulatory sequences recuireG for such a cell-type specific inactivation l^^epenc upon the psr.icu.ar 
ceil type of interest, and will be apparent to those of skill in the art. 

Once transgenic animals nave been generated, the expression of the recombinant IGS4 gene 
5 and protein may be assayed utilizing standard techniques. Initial screening may be accomplished by 
Southern blot analysis or PCR technicues to analyze animal tissues to assay whether integration of the 
transgene has taken place. The level of MRNA expression of the IGS4 transgene in the tissues of the 
transgenic animals may also be assessed using techniques which include but are not limited to Northern 
blot analysis of tissue samples obtained from the animal, in situ hybridization analysis, and RT-PCR. 

10 Samples of target gene-expressing tissue, may also be evaluated immunocytochemically using 
antibodies specific specific for the target gene transagene product of interest. The IGS4 transgenic 
animals that express IGS4 gene mRNA or IGS4 transgene peptide (detected immunocytochemically, 
using antibodies directed against target gene product epitopes) at easily detectable levels may then be 
further evaluated to identify those animals which display characteristic IGS4 based disorder symptoms. 

1 5 Once IGS4 transgenic founder animals are produced (i.e. . those animals which express IGS4 proteins in 
^ cells or tissues of interest, and which, preferably, exhibit symptoms of IGS4 based disorders), they may 
be bred, inbred, outbred, or crossbred to produce colonies of the particular animal. Examples of such 
breeding strategies include but are not limited to: outbreeding of founder animals with more than one 
integration site in order to establish separate lines; inbreeding of separate lines in order to produce 

20 compound IGS4 transgenics that express the IGS4 transgene of interest at higher levels because of the 
effects of additive expression of each IGS4 transgene; crossing of heterozygous transgenic animals to 
produce animals homozygous for a given integration site in order to both augment expression and 
eliminate the possible need for screening of animals by DNA analysis; crossing of separate homozygous 
lines to produce compound heterozygous or homozygous lines; breeding animals to different inbred 

25 genetic backgrounds so as to examine effects of modifying alleles on expression of the IGS4 transgene 
and the development of !GS4-!ike symptoms, one such approach is to cross the IGS4 transgenic 
founder animals with a wild type strain to produce an F1 generation that exhibits IGS4 related disorder- 
^ like symptoms, such as those described above. The F1 generation may then be inbred in order to 
^ develop a homozygous line, if it is found that homozygous target gene transgenic animals are viable. 



30 



Vaccines 

Another aspect of the invention relates to a method for inducing an immunological response in a 
r-a-mal which comprises inoculating the mammal with the IGS4 polypeptide, or a fragment thereof, if 
-ecu. red together with a RAMP polypeptide, adequate to produce antibody and/or T cell immune 
response to protect said animal f r om CNS disorders, including schizophrenia, episodic paroxysmal 
a-xiety fEPAi c, soccers such as obsessive compulsive disorder (OCD). ocst traumatic stress ciscae 
■~TS3). pnaaia a^c panic. major depressive disorder, bipolar disorder. PsrKinsorVs cisease, genera 
a-. / ety d:samea aaasm. co r...m, Alzheimer cisease aementia. severe menta retardation ana 
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dyskinesias, such as 



Hunting 




isease or Gilles dela Tourett's syndroi 




norexia, bulimia, stroke, 



addiction/dependency/craving, sleep disorder, epilepsy, migraine; attention deficit/hyperactivity disorder 
(ADHD); cardiovascular disease including heart failure, angina pectoris, arrhythmias, myocardial 
infarction, hypotension; hypertension, thrombosis; dyslipidemias; Obesity; emesis; IBS; IBD; GERD; 
5 conditions of delayed gastric emptying, such as postoperative or diabetic gastroparesis; osteoporosis; 
infections such as bacterial, fungal, protozoan and viral infections, particularly infections caused by HIV- 
1 or HlV-2; pain; cancers; urinary retention; ulcers; asthma; allergies and benign prostatic hypertrophy, 
among others. Yet another aspect of the invention relates to a method of inducing immunological 
response in a mammal which comprises delivering the IGS4 polypeptide via a vector directing 
10 expression of the IGS4 polynucleotide in vivo in order to induce such an immunological response to 
produce antibody to protect said animal from diseases 

A further aspect of the invention relates to an immunological/vaccine formulation (composition) 
which, when introduced into a mammalian host, induces an immunological response in that mammal to 

J5 an IGS4 polypeptide wherein the composition comprises an IGS4 polypeptide or IGS4 gene. The 

™ vaccine formulation may further comprise a suitable carrier. Since the IGS4 polypeptide may be broken 
down in the stomach, it is preferably administered parenterally (including subcutaneous, intramuscular, 
intravenous, intradermal etc. injection). Formulations suitable for parenteral administration include 
aqueous and non-aqueous sterile injection solutions which may contain anti-oxidants, buffers, 

20 bacteriostats and solutes which render the formulation isotonic with the blood of the recipient; and 
aqueous and non-aqueous sterile suspensions which may include suspending agents or thickening 
agents. The formulations may be presented in unit-dose or multi-dose containers, for example, sealed 
ampoules and vials and may be stored in a freeze-dried condition requiring only the addition of the 
sterile liquid carrier immediately prior to use. The vaccine formulation may also include adjuvant 

25 systems for enhancing the immunogenicity of the formulation, such as oil-in water systems and other 
systems known in the art. The dosage will depend on the specific activity of the vaccine and can be 
readily determined by routine experimentation. 



The IGS4 polypeptide of the present invention may be employed in a screening process for 
compounds which bind the receptor and which activate (agonists) or inhibit activation of (antagonists) 
the receptor polypeptide of the present invention. Thus, polypeptides of the invention may also be used 
to assess the binding of small molecule substrates and ligands in, for example, cells, cell-free 
35 preparations, chemical libraries; and natural product mixtures. These substrates and ligands may be 
natural substrates and ligands or may be structural or functional mimetics. 

IGS4 polypeptides are responsible for biological functions, including pathologies. Accordingly, .: is 
desirable to find compounds and crags which stimulate IGS4 on the one hand and which caa inhibit 'a e 




Screening Assays 
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function of IGS4 on the other hand, in general, agonists are employed WPferaoeutic and prophylactic 
purposes for such concitions as CNS disoroers. including schizophrenia, episodic paroxysmal anxiety 
(EPA; disorcers such as obsessive compulsive disorcer (OCD), post traumatic stress disorder (PTSD), 
phobia and panic, major depressive disorder, bipolar disorder, Parkinson's disease, general anxiety 
5 disorder, autism, delirium. Alzheimer disease/dementia, severe mental retardation and dyskinesias, 
such as Huntington's disease or Giiles dela Tourett's syndrome, anorexia, bulimia, stroke, 
addiction/dependency/craving, sleep disorder, epilepsy, migraine; attention deficit/hyperactivity disorder 
(ADHD); cardiovascular disease including heart failure, angina pectoris, arrhythmias, myocardial 
infarction, hypotension; hypertension, thrombosis; dyslipidemias; obesity; emesis; IBS; IBD; GERD; 
10 conditions of delayed gastric emptying, such as postoperative or diabetic gastroparesis; osteoporosis; 
infections such as bacterial, fungal, protozoan and viral infections, particularly infections caused by H1V- 
1 or HIV-2; pain; cancers; urinary retention; ulcers; asthma; allergies and benign prostatic hypertrophy. 
Antagonists may be employed for a variety of therapeutic and prophylactic purposes for such conditions 
as CNS disorders, including schizophrenia, episodic paroxysmal anxiety (EPA) disorders such as 
^15 obsessive compulsive disorder (OCD), post traumatic stress disorder (PTSD), phobia and panic, major 
™ depressive disorder, bipolar disorder, Parkinson's disease, general anxiety disorder, autism, delirium, 
Alzheimer disease/dementia, severe mental retardation and dyskinesias, such as Huntington's disease 
or Giiles dela Tourett's syndrome, anorexia, bulimia, stroke, addiction/dependency/craving, sleep 
disorder, epilepsy, migraine; attention deficit/hyperactivity disorder (ADHD); cardiovascular disease 
20 including heart failure, angina pectoris, arrhythmias, myocardial infarction, hypotension; hypertension, 
thrombosis; dyslipidemias; obesity; emesis; IBS; IBD; GERD; conditions of delayed gastric emptying, 
such as postoperative or diabetic gastroparesis; osteoporosis; infections such as bacterial, fungal, 
protozoan and viral infections, particularly infections caused by HIV-1 or HIV-2; pain; cancers; urinary 
retention; ulcers; asthma; allergies and benign prostatic hypertrophy. 

25 

In general, such screening procedures involve producing appropriate cells, which express the 
receptor polypeptide of the present invention on the surface thereof and, if essential co-expression of 
^ RAMP's at the surface thereof. Such cells include cells from mammals, yeast, Drosophila or E. coli. 

Cells expressing the receptor (or cell membrane containing the expressed receptor) are then contacted 
30 with a test compound to observe binding, or stimulation or inhibition of a functional response. 

One screening lechmcue includes the use of cells which express the receptor of this invention (for 
example, transferee CHO cells) in a system which measures extracellular pH or intracellular calcium 
changes caused by receptor activation. In this technique, compounds may be contacted with cells 
3n expressing the receptor polypeptide of the present invention. A second messenger response, e.g., 
signal transduction, pH chances, or chances in calcium level, is then measured to determine whether 
the potential compound activates or" mhioits the receptor. 
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/e^^^eening for receptor inhibitors by determinWgir 



Another method involve^^Jeening for receptor inhibitors by determining inhibition or simulation of 
receptor-mediated cAMP and/or adenylate cyclase accumulation. Such a method involves transfecting 
an eukaryotic cell with the receptor of this invention to express the receptor on the cell surface. The cell 
is then exposed to potential antagonists in the presence of the receptor of this invention. The amount of 
5 cAMP accumulation is then measured. If the potential antagonist binds the receptor, and thus inhibits 
receptor binding, the levels of receptor-mediated cAMP, or adenylate cyclase, activity will be reduced or 
increased. 

Another method for detecting agonists or antagonists for the receptor of the present invention is 
1 0 the yeast-based technology as described in U.S. Patent 5,482,835, incorporated by reference herein. 

The assays may simply test binding of a candidate compound wherein adherence to the celts 
bearing the receptor is detected by means of a label directly or indirectly associated with the candidate 
compound or in an assay involving competition with a labeled competitor. Further, these assays may 
^5 test whether the candidate compound results in a signal generated by activation of the receptor, using 
detection systems appropriate to the cells bearing the receptor at their surfaces. Inhibitors of activation 
are generally assayed in the presence of a known agonist and the effect on activation by the agonist by 
the presence of the candidate compound is observed. 

20 Further, the assays may simply comprise the steps of mixing a candidate compound with a 

solution containing an IGS4 polypeptide to form a mixture, measuring the IGS4 activity in the mixture, 
and comparing the IGS4 activity of the mixture to a standard. 

The 1GS4 cDNA, protein and antibodies to the protein may aiso be used to configure assays for 
25 detecting the effect of added compounds on the production of IGS4 mRNA and protein in cells. For 
example, an ELISA may be constructed for measuring secreted or cell associated levels of IGS4 protein 
using monoclonal and polyclonal antibodies by standard methods known in the art, and this can be 
^ used to discover agents which may inhibit or enhance the production of IGS4 (also called antagonist or 
agonist, respectively) from suitably manipulated cells or tissues. Standard methods for conducting 
30 screening assays are well known in the art. 

Examples of potential IGS4 antagonists include antibodies or, in some cases, oligonucleotides or 
proteins which are closely related to the Itgand of the IGS4, e.g., a fragment of the ligand, or small 
molecules which bind to the receptor but do not elicit a response, so that the activity of the receptor is 

35 prevented. 



Thus in another aspect, the present invention relates to a screening kit for- identifying agonists, 
antagonists, ligands, receptors, substrates, enzymes, etc. for iGS4 polypeptides; or compounds which 
decrease or enhance the production of IGS4 polypeptides, which comprises: 
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(a) an IGS4 polypeptide, preferably that cf SEQ ID NO: 2, SEG ID^TO: 4, SEQ ID NO: 6 or SEC- 
ID NO: 8; 

(b) a recombinant cell expressing an IGS4 polypeptice, preferably that of of SEQ ID NO: 2, SEQ 
ID NO: 4, SEQ ID NO: 6 or SEQ ID NO: 8; 

(c) a cell membrane expressing an IGS4 polypeptide: preferably that of of SEQ iD NO: 2, SEQ 
ID NO: 4, SEQ ID NO: 6 or SEQ ID NO: 8; or 

(c) antibody to an IGS4 polypeptide, preferably that of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID 
NO: 6 or SEQ ID NO: 8. 

It will be appreciated that in any such kit, (a), (b), (c) or (d) may comprise a substantial component. 



Prophylactic and Therapeutic Methods 



This invention provides methods of treating abnormal conditions related to both an excess of and 
insufficient amounts of IGS4 activity. 

If the activity of !GS4 is in excess, several approaches are available. One approach comprises 
administering to a subject an inhibitor compound (antagonist) as hereinabove described along with a 
pharmaceutical^ acceptable carrier in an amount effective to inhibit activation by blocking binding of 
iigands to the IGS4, or by inhibiting interaction with a RAMP polypeptide or a second signal, and thereby 
alleviating the abnormal condition. 

In another approach, soluble forms of IGS4 polypeptides still capable of binding the ligand in 
competition with endogenous IGS4 may be administered. Typical embodiments of such competitors 
comprise fragments of the IGS4 polypeptide. 

In still another approach, expression of the gene encoding endogenous IGS4 can be inhibited 
using expression-blocking techniques. Known such techniques involve the use of antisense sequences, 
either internally generated or separately administered. See, for example, O'Connor, J Neurochem 
(1991) 56:560 in Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, Boca 
Raton, Florida USA (1988). Alternatively, oligonucleotides, which form triple helices with the gene, can 
be supplied. See, for example, Lee et al., Nucleic Acids Res (1979) 6:3073; Cooney et a!.. Science 
(1988) 241 :456; Dervan et al, Science (1991) 251:1360. These oligomers can be administered per se or 
the relevant oligomers can be expressed in vivo. 

For treating abnormal conditions related to an under-expression of IGS4 and its activity, several 
approaches a-e also available. One approach comprises administering to a subject a therapeutical'"/ 
effective amount of a compound which activates IGS4. i.e.. an agonist as tiesc-bed aoove. in 
combination w\ir\ a oharmaceut ca ; y acceptable carne r , to thereby alleviate the abnormal conc.ticr.. 
Alternatively, eerie merapy may ce employee to c'fec! the endogenous production cf IGS- ::■_> tne 
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relevant cells in the subject! 




example, a polynucleotide of the inve 




may be engineered for 



expression in a replication defective retroviral vector, as discussed above. The retroviral expression 
construct may then be isolated and introduced into a packaging cell transduced with a retroviral plasmid 
vector containing RNA encoding a polypeptide of the present invention such that the packaging cell now 
produces infectious viral particles containing the gene of interest. These producer cells may be 
administered to a subject for engineering cells in vivo and expression of the polypeptide in vivo. For 
overview of gene therapy, see Chapter 20, Gene Therapy and other Molecular Genetic-based 
Therapeutic Approaches, (and references cited therein) in Human Molecular Genetics, Strachan T. and 
Read A.P., BIOS Scientific Publishers Ltd (1996). 

Formulation and Administration 

Peptides, such as the soluble form of IGS4 polypeptides, and agonists and antagonist peptides or 
small molecules, may be formulated in combination with a suitable pharmaceutical carrier. Such 
formulations comprise a therapeutically effective amount of the polypeptide or compound, and a 
pharmaceutical ly acceptable carrier or excipient. Formulation should suit the mode of administration, 
and is well within the skill of the art. The invention further relates to pharmaceutical packs and kits 
comprising one or more containers filled with one or more of the ingredients of the aforementioned 
compositions of the invention. 

Polypeptides and other compounds of the present invention may be employed alone or in 
conjunction with other compounds, such as therapeutic compounds. 

p re f erre{ j forms of systemic administration of the pharmaceutical compositions include injection, 
typically by intravenous injection. Other injection routes, such as subcutaneous, intramuscular, or 
intraperitoneal, can be used. Alternative means for systemic administration include transmucosal and 
transdermal administration using penetrants such as bile salts or fusidic acids or other detergents. In 
addition, if properly formulated in enteric or encapsulated formulations, oral administration may also be 
possible. 

The dosage range required depends on the choice of peptide, the route of administration, the 
nature of the formulation, the nature of the subject's condition, and the judgment of the attending 
practitioner. Suitable dosages, however, are in the range of 0.1-100 ug/kg of subject. Wide variations in 
the needed dosage, however, are to be expected in view of the variety of compounds available and the 
differing efficiencies of various routes of administration. For example, oral administration would be 
expected to require higher dosages than administration by intravenous injection. Variations in these 
dosage levels can be adjusted using standard empirical routines for optimization, as is well understood 
in the art. 
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Polypeptides usee in treatment can aiso be generated endogenou^^n the subject, in treatment 
mocaiities often referred to as "gene therapy" as described above. Thus, for example, cells from a 
subject may be engineered with a polynucleotide, sucn as a DNA or RNA, to encode a polypeptide ex 
vivo, and for example, by the use of a retroviral plasmid vector. The cells are then introduced into the 
subject. 



EXAMPLE 1. THE CLONING OF CDNA ENCODING A NOVEL G PROTEIN-COUPLED RECEPTOR. 

10 Example 1a. Homology PCR cloning of a genomic fragment encoding a novel G-protein coupled 
receptor (GPCR). 

A PCR based homology cloning strategy was used to isolate partial genomic DNA sequences 
encoding novel G-protein coupled receptors. Forward (F22) and reverse (R44 and R46) degenerate 
15 PCR primers were designed in conserved areas of the neurotensin receptor gene family (Vita N. et 
) al.[1993] Febs Lett. 317, 139-142; Vita N. et at. [1998] Eur. J. Pharmacol. 360, 265-272) within 
transmembrane domains 1 (TM1) and 3 (TM3) and at the boundary between TM3 and intracellular loop 
2 (12): 

20 

F22 (TM1): 

5'-CTCATCTTCGCGGTGGGC(A or G)C(A,C,G or T)G(C or T)(A,C.G or T)GG-3' 

(SEQ ID NO: 13) 

25 R44 (TM3/I2): 

5'-GGCCAGGCAGCGCTCCGCGCT(C or lnosine)A{A or G)(A,C T G or T)C(C or T)(A,C,G or T)GC(A,G 
orT)-3" 

(SEQ ID NO: 14) 

^0 R46 (TM3): 

5'_GAA(A or G)TA(A or G)TAGCC(A or G)CG(A or G)CAGCC(A or T)-3" 

(SEQ ID NO: 15) 

In order to suppress amplification of known members of the neurotensin receptor family, the 3' 
35 ultimate nucleotide position of primer R44 was chosen in such a way that is was not complementary to 
:ne corresponding position of ooth NTR1 and NTR2 cDNA. The primary PCR reaction was earned out in 
a 50ul volume and contained 100nc human genomic DNA (Clontech), 5 ul GeneAmp v 10 x PCR buffer 
I! 'CQmM Tns-HCI pH 8.3; 500 rrM KCI. Perkm Elmer). 3.6 ul 25 rr.M McC! : 0.36 u! dNTPs (25mM of 
each dNTP). 1.5 units AmpiiTac-Goid"' polymerase (Perkin Elmeri and 30 pmoles of each of tne 
4C generated forwarc ■ F22; and reverse primer (R44 ; . Reaction tubes were heatec at Z'i C for :C rr-.r: 
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and then subjected to 35 cy^^f denaturation {95°C, 1 min), annealing^FC, 2 min) and extension 
(72 C C, 3min). Finally reaction tubes were heated for 10 mm at 72°C. 

For the semi-nested PCR reaction 1 ul of a 1/50 dilution of the primary PCR reaction was used 
as a template using the degenerate forward and reverse primers F22 and R46 respectively. The semi- 
5 nested PCR reaction was carried out under the same conditions as the primary PCR reaction. 

Semi-nested PCR reaction products were size fractionated on an agarose gel and stained with 
ethidium bromide. Although a fragment of ± 220 bp was expected, only a fragment of ± 120 bp was 
visible. This fragment was purified from gel using the Qiaex-II™ purification kit (Qiagen) and ligated into 
the pGEM-T plasmid according to the procedure recommended by the supplier (pGEM-T kit, Promega). 

10 The recombinant plasmids thus produced were used to transform competent E. coli SURE™ 2 bacteria 
(Stratagene). Transformed cells were plated on LB agar plates containing ampicillin (100 ug/ml), IPTG 
(0.5 mM) and X-gal (50 pg/ml). Plasmid DNA was purified from mini-cultures of individual colonies using 
a Qiagen-tip 20 miniprep kit (Qiagen). DNA sequencing reactions were carried out on the purified 
plasmid DNA with the ABI Prism™ BigDye™ Terminator Cycle Sequencing Ready Reaction kit (PE-ABI), 

15 using insert-flanking primers. 

^ Table 7: overview of oligo primers used. 



SEQ ID NO: 13 


F22: 5'-CTCATCTTCGCGGTGGGC(A or G)C(A,C,G or T)G(C or T)(A,C.G or 
T)GG-3' 


SEQ ID NO: 14 


R44: 5'-GGCCAGGCAGCGCTCCGCGCT(C or !nosine)A(A or G)(A,C,G or T)C(C 
or T)(A,C,G or T)GC(A,G or T)-3* 


SEQ ID NO: 15 


R46: 5'-GAA(A or G)TA(A or G)TAGCC(A or G)CG(A or G)CAGCC(A or T)-3' 


SEQ ID NO: 16 AP1: 5'-CCATCCTAATACGACTCACTATAGGGC-3' 


SEQ ID NO: 17 


AP2: 5'-ACTCACTATAGGGCTCGAGCGGC-3' 


SEQ ID NO: 18 


IGS4R1 : 5'GGATCCCAAATAAGAAAGGGTAGTTGC-3' 


SEQ ID NO: 19 


1GS4R2: 5'AAAGGGTAGTTGCGCCACATCTCATAGAC-3' 


SEQ ID NO: 20 


IGS4F5: 5'AGGTCTATGAGATGTGGCGCAACTACCCT-3' 


SEQ ID NO: 21 


IGS4F6: 5'ATGTGGCGCAACTACCCTTTCTTATTTGGG-3' 


SEQ ID NO: 22 


R74: 5'-CGGAAGTTGGCGGACACG(A or G)(A.C or G)(A or G)TT(A or G)TA-3' 


SEQ ID NO: 23 


IGS4F7: 5'-GCTCAGCTTGAAACAGAGCCTCGTACC-3' 


SEQ ID NO: 24 


IGS4F8: 5'-CCATGTGGATCTACAATTTCATCATCC-3' 


■ SEQ ID NO: 25 


1GS4F9: 5'-AAGACAAATCTCTTGAGGCAGATGAAGGG-3' 



SEQ ID NO: 26 | IGS4F10: 5'-GATGCTGTTTGTCTTGGTCTTAGTGTTTGC-3' 



SEQ ID NO: 27 j IGS4R5: 5'-GGATGATGAAATTGTAGATCCACATGGGC-3' 



SEQ ID NO: 28 



IGS4R6: 5'-TGTGGAGAAGTCTCTCAAAGTGTGG-3* 



SEQ ID NO: 29 ; IGS4R7: 5'-TAGTAGGAGTGACAGCCTGACTCGGAACG-3' 
SEQ ID NO: 30 ; IGS4R8: 5'-AACGTAGATGACTCAGGACGAACCATTTC03' 



SEQ ID NO: 31 



IGS4F1 1 : 5"-TCGTACCAGGGGAGGCTCAGGC 
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Sequencing reaction products were purified via EtOH/NaOAc precipitation and analyzed on an ABI 377 
automated sequencer. 

Sequence analysis of the insert of cione HNT1552 showed that it potentially encoded part of a novel 
5 member of the GPCR family. We refer to this novel GPCR sequence as IGS4. 

Example 1b. Cloning of cDNA fragments containing the complete IGS4 coding sequence. 

The complete coding sequence of IGS4 cDNA was obtained via both RACE analysis (rapid 

1 0 amplification of cDNA ends) and RT-PCR amplification. 5'- and 3' RACE PCR reactions were performed 
on Marathon-Ready™ cDNA from human brain or testis (Clontech n° 7400-1 and 7414-1 respectively), 
using the adaptor primers 1 and 2 (AP1: SEQ ID NO: 16 ; AP2: SEQ ID NO: 17) provided with the 
Marathon™ cDNA amplification kit (Clontech K1 802-1) and IGS4 specific primers. PCR RACE reactions 
were performed according to the instructions of the Marathon-Ready™ cDNA user manual provided by 

15 Clontech. RACE products were separated on agarose gel, visualized with ethidium bromide and blotted 
) onto Hybond N + membranes. Blots were prehybridized at 65°C for 2 h in modified Church buffer (0.5M 
phosphate, 7% SDS, 10 mM EDTA) and then hybridised overnight at 65°C in the same buffer containing 
2 x 10 5 cpm / ml of a 32 P-labelled IGS4 cDNA probe. IGS4 cDNA probes were radiolabeled via random 
primed incorporation of [a- 32 P]dCTP to a specific activity of > 10 9 cpm/ug using the Prime-It II kit™ 

20 (Stratagene) according to the instructions provided by the supplier. Hybridized blots were washed at 
high stringency (2 x 30 min at room temperature in 2 x SSC/ 0.1 % SDS, followed by 2 washes of 40 min 
at 65°C in 0.1 x SSC, 0.1% SDS) and autoradiographed overnight. Hybridizing fragments were purified 
from a preparative gel, cloned into the pGEM-T vector and sequenced as described above. 

An initial round of semi-nested 5' RACE analysis on human brain cDNA using the IGS4 specific 

25 primers IGS4R1 (SEQ ID NO: 18) and IGS4R2 (SEQ ID NO: 19)(designed on the DNA sequence of 
clone HNT1552) yielded clones HNT1886 and HNT1887 (Fig.1). These clones extended the IGS4 cDNA 
sequence upstream up to and beyond the putative start of translation codon. Likewise an initial round of 
3' RACE analysis on human brain cDNA using IGS4 specific primers IGS4F5 (SEQ ID NO: 20) and 
^ IGS4F6 (SEQ ID NO: 21) yielded clones HNT1874-1878 and HNT1902-1903 (Fig.1). These clones 

30 extended the known IGS4 cDNA at the 3' end. 

All sequences obtained at this point were assembled into a single contig which contained a long 
open reading frame, encoding part of a novel protein that was most similar to human orphan receptor 
FM-3 (Tan et al., Genomics 52, 223-229 [1998], GenBank accession n° AF044600 and AF044601). To 
investigate the RNA expression profile of IGS4, a Master Blot™ membrane (Clontech cat n° 7770-1) 

35 containing RNA from different human tissues was hybridized to the 3: P-labelled insert of clone HNT1903 
under the conditions recommenced by the supplier. The strongest hybridization was obtained with testis 
RNA whereas much weaker signals were obtained in prostate, stomach, spinal cord, hippocampus, 
meculla oblongata, thyroid gland, tnymus, lung and trachea. 

Since the cont:c sequence did not yet contain the complete IGS4 coding sequence we set up an 
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RT-PCR homology cloning e^^ment or, human total brain RNA using specific primer IGS4F6 

(SEG ID NO: 21) and a degenerated primer (R74, SEQ ID NO: 22), which was designed in a conserved 
area (at the TM7/C-terminal intracellular part) of the GPCR subfamily composed of the neurotensin 
receptors 1 and 2, the growth hormone secretagogue receptor {Howard A.D. et al.[1996] Science 273, 
5 S74-977) and the orphan GPCR FM-3 and GPR38 (McKee K.K. et al.[1997] Genomics 46, 426-434). 
RT-PCR reactions were carried out in a 50 pi volume on 500 ng total RNA from human brain using the 
Titan™ One Tube RT-PCR System (Boehringer catalogue n° 1,888,382) according to the 
recommendations of the supplier. Briefly, RT-PCR conditions were as follows: reverse transcription for 
45 min at 55'C; 2 min denaturation at 94°C, followed by a touch-down PCR reaction of 20 cycles (30 

1 0 sec denaturation at 94°C, 30 sec annealing at 60°C [-0.25°C/cycle3 and 2 min extension at 68°C) and an 
additional round of 30 PCR cycles (30 sec denaturation at 94°C, 30 sec annealing at 55°C and 3 min [+ 
5 sec/cycle] extension at 68°C). This was concluded with an extra extension step of 7 min at 68°C. 
Reaction products were analyzed via Southern blotting using the radiolabeled insert of clone HNT1903. 
A fragment of ± 690 bp that hybridized to the probe was purified from the gel (Qiaexll™, Qiagen) and 

15 cloned into the pGEM-T vector yielding clones HNT2210-2212. Sequence analysis of these clones 
^ allowed to extend the existing IGS4 cDNA contig in the 3' direction. 

Since the extended IGS4 cDNA contig still did not yet contain a translational stop codon, 
additional IGS4 specific 3' RACE primers were designed (IGS4F7-10, SEQ ID NO: 23-26)). Nested or 
semi-nested 3' RACE reactions were carried out on Marathon Ready™ cDNA from human testis 

20 (Clontech 7414-1). IGS4 specific bands (as assessed via Southern blot analysis using an IGS4 specific 
probe) were cloned into pGEM-T. This yielded clones HNT2289-90 (AP1/IGS4F5->AP2/1GS4F9), 
HNT2293-2295 (AP1/IGS4F6->AP2/IGS4F9), HNT2296-2297 (AP1/IGS4F7->AP2/IGS4F9), HNT2308- 
2310 (AP1/IGS4F8->AP2/IGS4F10) HNT2253 (AP1 /IGS4F7->AP1/IGS4F5). An additional 5' RACE 
PCR reaction carried out on testis Marathon Ready™ cDNA yielded clones HNT2279-2281 

25 (AP2/IGS4R6->AP2/IGS4R5). (note: AP1 /IGS4F5->AP2/IGS4F9 e.g. indicates that clones were 
generated from an IGS4 specific fragment obtained after the primary RACE PCR reaction [using primer 
pair AP1/IGS4F5] was nested with primer pair AP2/IGS4F9). 
^ Sequence analysis of these clones allowed to extend the existing 1GS4 cDNA contig further in the 3' 
™ direction although the end of the !GS4 coding sequence was not yet been reached. A computer- 

30 assisted homology search (Blastn; Altschut S.F. et al.. Nucleic Acids Res. (1997) 25:3389-3402) of the 
IGS4 contig DNA sequence against the expressed sequence tag (EST) database (dbest) showed the 
presence of an EST sequence (accession n° N45474) which overlapped with the 3' end of the IGS4 
contig (near 100 % identity in the overlap area). EST N45474 further extended the IGS4 DNA contig at 
the 3' end into a translational stop codon and into the 3' untranslated region (3'-UTR). In addition 

35 another set of ESTs was identified which all covered the 3'-UTR of the IGS4 mRNA {Fig. 2). Additional 
IGS^ soecific primers {IGS4R7-8, SEQ ID NO: 29-30)) were designed within the 3'-UTR of these ESTs. 
Primary PCR reactions were carried out on Marathon Ready™ cDNA from human testis using various 
combinations of the 1GS4F7 (SEQ ID NO: 23), IGS4F11 (SEQ ID NO: 31) and IGS4R7-8 (SEQ ID NC: 
29-30) primers. PCR tabes wer-e heated for 2 mm at G5 C C and then subjected to 35 cycles of 



34 DiR 3566 P 



~V annealinc (65 C C. 30 sec) and extension (7^^ 



denaturation (95°C, 30 sec), annealing (65 C C, 30 sec) and extension [Wm % 1 min 30 sec). Finally the 
reactions tubes were heated at 72 C C for 10 min. Nested PCR reactions were also carried out under the 
same conditions. DNA fragments of ± 1630 bp were purified from gel and cloned into the pGEM-T 
vector. The following clones were obtained: HNT2311, HNT2312 and HNT2317 (1GS4F7/IGS4R7- 
5 >iGS4F11/!GS4R8); HNT2313, HNT2324, HNT2326 and HNT2328 (IGS4F1 1->IGS4R8); HNT2314, 
HNT2315 and HNT2322 (IGS4F1 1 ->R7). Clone HNT2363 was obtained from a purified 1630 bp PCR 
fragment, that was amplified from human testis Marathon Ready™ cDNA using the IGS4F11/R7 primer 
pair under the following slightly modified conditions. After an initial denaturation of 2 min at 94°C, PCR 
tubes were subjected to 15 cycles of denaturation [15 sec, 94°C], annealing [30 sec, 65 C C] and 

10 extension [2 min, 72°C] followed by another 20 cycles of denaturation [15 sec, 94°C], annealing [30 sec, 
65°C] and extension [ 2 min, 72°C; + 10sec/cycle]. There was a final extension step of 7 min at 72°C. 
Sequence analysis of these clones allowed to assemble an IGS4 cDNA consensus sequence (Fig.1). 
Close inspection of all clones showed that they actually were of 2 sequence types, which differed at 5 
nucleotide positions. These variant sequences correspond to a polymorphism within the human 
^5 population. We refer to these different cDNA types as IGS4ADNA (SEQ ID NO: 1 and SEQ ID NO: 3) 
™ and IGS4BDNA (SEQ ID NO: 5 and SEQ ID NO: 7). The consensus sequence contained a long open 
reading frame that contained two in-frame start codons (positions 55-57 (SEQ ID NO: 1 and SEQ ID 
NO: 5) and 64-66 (SEQ ID NO: 3 and SEQ ID NO: 7) in IGS4ADNA and IGS4BDNA), predicting a 
protein of either 415 (SEQ ID NO: 2 and SEQ ID NO: 6) or 412 (SEQ ID NO: 4 and SEQ ID NO: 8) 

20 amino acids, which showed good homology to GPCR proteins. Hydropathy analysis (Kyte J. et ai.[1 982] 
J. Mol. Biol. 157: 105-132; Klein P.et al.[1985] Biochim. Biophys. Acta 815:468-476) of the protein also 
indicated the presence of 7 transmembrane domains. Since the first ATG initiator codon is within a weak 
"Kozak" translation initiation context and the second one is in a strong Kozak context, it is likely that the 
IGS4A/B protein starts at the second methionine and is 412 amino acids long (Kozak M. [1999] Gene 

25 234, 187-208). However some (or even exlusive) initiation at the first ATG cannot be excluded. Among 
the five polymorphic nucleotides, four (positions 947, 999, 1202 and 1216 in IGS4A/BDNA) resulted in a 
switch in the encoded amino acid residue, whereas the fifth (pos 1381 in IGS4A/BDNA) was within the 
^ 3'-UTR. The respective predicted protein sequences are referred to as IGS4APROT (SEQ ID NO: 2 and 
SEQ ID NO: 4) and IGS4BPROT (SEQ ID NO: 6 and SEQ ID NO: 8). (note 1: the sequence of 

30 IGS4APROT and IGS4BPROT in this document is represented as the longest possible (415 ammo 
acids) sequence but it is understood that the actual protein might be 3 amino acids shorter at the amino- 
terminus; for this reason the first 3 amino acids of IGS4APROT and IGS4BPROT in Table 4 and 5 have 
been bracketed) (note 2: In this document IGS4 refers to the IGS4 sequence in general, irrespective of 
the particular allelic type). Homology searches of the IGS4 protein sequence against public domain 

35 protein databanks showed best homology to the human orphan GPCR FM-3 (accession no C43664, 
Tan CP., et al. Genomics (1998) 52: 223-229; 46% identity in IGS4A amino acid residues 26-342). 



Homology searches of DNA databanks with the IGS4 cDNA sequence yielded a number of entries 
which were also ceri.ed from the IGS4 gene locus (Fig. 2 for overview): 
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3^^ession nrs W61169, AI432384, W61131^fe: 



. 10 EST sequence entries J^bssion nrs W61169, A1432384, W611 31 .W23570, F01358, F03770, 
Z38158, R40869, R37725, H11333), 2 STS (sequence tagged sites) (accession nrs G20615 and 
G05725) and one genomic sequence (accession nr AQ078563) were discovered which were all 
derived from the 3'-UTR of IGS4 cDNA. 
5 • EST accession n° N45474 encoded the 3' end of the IGS4 coding sequence and part of the 3* UTR 
(cfr supra). 

• A 'working draft' high throughput genomic sequence (accession nr AC008571, version AC008571.1, 
deposited 3 AUG 1999), which consisted of 42 unordered contigs assembled in an arbitrary order 
was discovered in which we detected the entire IGS4 cDNA sequence in 4 separate areas. These 
10 areas most likely correspond to the different IGS4 exons as they were flanked by canonical splice 

donor and acceptor sequences. On the basis of this analysis the position of the different exons in 
the IGS4ADNA (or IGS4BDNA) sequence can be defined as follows: exonl (1-780), exon 2 (781- 
865), exon 3 (866-991) and exon 4 (992-1658). The AC008571 genomic sequence is of the IGS4A 
allelic type. 

15 • 6 overlapping EST entries (accession nrs H11359, R13890, R13353, F07531, F05108, F05107) 
were discovered of which the assembled DNA sequence overlapped at its 3' end with IGS4 exon2 
and the beginning of exon 3. However the DNA sequence upstream of exon 2 was completely 
different from IGS4 exonl. Probably these six EST's are derived from transcripts which originated 
from an alternative promoter. 

20 • Finally 2 genomic sequence entries (accession nrs AQ019411 and AQ015065) were discovered 
which encoded exon 2. 

Among the many 1GS4 cDNA clones that we isolated in the different experiments described above, 
we also discovered a number of clones that contained a 64 bp deletion (pos 866-929 in IGS4ADNA) 
25 (besides a number of clones derived from unspliced [or partially spliced] transcripts). So far we only 
discovered truncated transcripts of the polymorphic type A. We refer to this splice variant cDNA 
sequence as IGS4A-64DNA (SEQ ID NO: 9 and SEQ ID NO: 11). Since this deletion occurs exactly at 
the exon 2/exon 3 boundary and since the last 2 nucleotides of the deleted fragment are "AG", it is likely 
that this deletion represents an alternative splicing event in which the "AG" within exon 3 served as a 
fco splice acceptor. The IGS4A reading frame encoded by the splice variant is frameshifted beyond the 
deletion point. The encoded (truncated) protein of 296 amino acids is referred to as IGS4A-64PROT 
(SEQ ID NO: 10 and SEQ ID NO: 12). Hydropathy analysis of the IGS4A-64PROT sequence shows that 
this protein only contains 5 transmembrane domains (corresponding to TM domains 1-5 of 
IGS4APROT). This truncated receptor might have physiological relevance. 

35 

The bacterial strain harboring plasmid HNT2322 (containing the 1GS4ADNA insert) was 
recloned after replating on LB agar plates containing 100 ug ampiciilin/ml and deposited both in the 
Innogenetics N.V. strain list (ICCG4320) and at the "Centraalbureau voor Schimmeiculturen (CBS)" in 
Baarn. The Netherlands (accession n ; CBS102221). Plasmid DNA was prepared from the ^ec ! oned 
40 isolate and the insert was resecuenced and found to be identical to the IGS4ADNA sequence. 

The bacterial strain harboring plasmid HNT23G3 (containing the IGS4BDNA insert) was 
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recionec after replating on LE^acar plates containing 100 ug ampiciliin^fcnd ceposited both in the 
Innocenetics N.V. strain list (1CCG4340) and at the "Centraaibureau voor Schimmelculturen (CBS)" in 
Eaam, The Netherlands (accession n c CBS1 02222). Plasmic DNA was prepared from the recioned 
isolate and the insert was resequenced and found to be identical to the IGS4BDNA sequence. 



10 



All publications, including but not limited to patents and patent applications, cited in this specification are 
herein incorporated by reference as if each individual publication were specifically and individually 
indicated to be incorporated by reference herein as though fully set forth. 

The figures show: 



Fig.1 Schematic representation of the relative positions of the different cDNA clones that were 
isolated to generate the consensus IGS4 cDNA sequence. 5* and 3' RACE primers that were 
^5 used are also indicated (IGS4R# and !GS4F# respectively) as well as the position of EST 

™ accession n° N45474. Primer IGS4R6 was located within intron 1 . Some clones (e.g. HNT231 1 , 

HNT2312 and HNT2253) were only partially sequenced (only the part that was sequenced is 
indicated). CONSENSUS A and CONSENSUS B denote the consensus sequence of IGS4 
allelic types A and B respectively. The nucleotide that was identified at each of the 4 
20 polymorphic positions is indicated (shaded boxes) for each clone. "S" indicates a sequence 

ambiguity in clones HNT2211 and HNT2212 and means either X" or "T\ The coding area of 
IGS4A and IGS4B consensus sequences is indicated with As there were some remaining 
sequence ambiguities in the 5* end of the consensus sequence, the IGS4ADNA and IGS4BDNA 
sequences have only been taken from position 86 until the end 

25 

Fig.2 Schematic representation of the relative positions of different DNA database entries compared 
to the IGS4 cDNA sequence. The IGS4 cDNA sequence is indicated with the boxes (the 
^ position of the IGS4 coding sequences is indicated with the filled boxes). The relative position of 

IGS4 exons 1-4 is indicated above the IGS4 cDNA sequence ("=="). The parts of the genomic 
30 sequence AC008571 that encode exons 1->4 are indicated with AC008571a->d respectively. 

The position of these fragments within the AC008571 sequence are: AC008571a (13129-13908 
of the reverse complement of AC008571), AC008571b (51676-51760 of AC008571), 
AC008571C (79978-80103 of the reverse complement of AC008571) and AC008571d (83060- 
83728 of the reverse complement of AC008571). G05725 and G20615 are STS (sequence 
3- tagged secuence) entries whereas F05107, F05108, F07531, R13353, R13890, H11359, 

N45474, W61169. AI432384. W61131. AI023570, F01358, F03770. Z38158. R40869. R37725. 
H1 1333 a r e EST entries. "The parts of genomic clones AQ019411 and AQ015065 that contain 
1GS4 exon 2 are indicated with The 5' part of EST sequences F05107. F05108, F07531. 



37 D1R 0566 P 



R13353, R13S90 ar^fcl35S which is totally different from t^^S4 cDNA sequence is 
indicated with "*". AQ078563 is a genomic clone. 
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SEQUENCE LISTING 

< 1 1 C > Duphar Interna: icnal Research BV 

<120> Novel human G-prctein coupled receptor 

<120> DIR 566 

<140> 
<241> 

<16 0> 31 

<170> Patentln Ver. 2.1 

<210> 1 

<211> 1658 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (55) . . (1299) 

<223> IGS4A Long version 

<400> 1 

agctcagctt gaaacagagc ctcgtaccag gggaggctca ggccttggat ttta atg 57 

Met 

1 

tea ggg atg gaa aaa etc cag aat get tec tgg ate tac cag cag aaa 
Ser Gly Met Giu Lys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin Lys 
5 10 15 

eta gaa gat cca ttc cag aaa cac ctg aac age ace gag gag tat ctg 
Leu Glu Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr Leu 
20 25 30 

gee ttc etc tgc gga cct egg cgc age cac ttc ttc etc cec gtg tct 
Ala Phe Leu Cys Gly Pre Arg Arg Ser His Phe Phe Leu Pro Val Ser 
35 40 45 

gtg gtg tat gtg cca att ttt gtg gtg ggg gtc att ggc aat etc ctg 
Val Val Tyr Val Pro lie Phe Val Val Gly Val He Gly Asn Val Leu 
5 0 55 60 65 

g^g tge ctg erg att: ctg cag cac cag get atg aag acg cec acc aac 
Val Cys Leu Val He Leu Gin His Gin Ala Met Lys Thr Pre Thr Asn 
70 75 SO 

tac tae etc etc age cz~ gcg g7e tcr gae etc ctg gtc etc etc ctt 
T/r Tvr Leu Phe Ser Leu Ala Val Ser Aso Leu Leu Val Leu Leu Leu 



105 



153 



201 



40 
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gga atg ccc ccg gag etc car gag a:g tgg cgc aa: tac ccc t:: ttg 393 
Gly Her Pre Leu C-lu Val Tyr Glu Met Trp Arg .-.sr. Tyr Pre Phe Leu 
IOC ICS no 



5 rrc ggg ccc org ggc rgc rac ttc aag acg gec crc ttt gag acc grg 441 

Fhe Gly Frc Val Gly Gys Tyr Phe Lys Thr Ala Leu Phe Glu Thr Val 
115 120 125 

tgc rrc ccc rcc arc crc age arc acc acc grc age gtg gag cgc tac 483 

10 Cys Phe Ala Ser lie Leu Ser lie Thr Thr Val Ser Val Glu Arg Tyr 

130 135 140 145 

gtg gec ate eta cac ccg ttc cgc gee aaa ctg cag age acc egg cgc 537 

Val Ala He Leu Kis Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg Arg 

15 150 155 160 

egg gee etc agg ate etc ggc ate gtc tgg ggc ttc tec gtg etc ttc 585 

Arg Ala Leu Arg He Leu Gly He Val Trp Gly Phe Ser Val Leu Phe 
165 170 175 

20 

tec ctg ccc aac acc age ate cat ggc ate aag ttc cac tac ttc ccc 633 

Ser Leu Pro Asn Thr Ser He His Gly He Lys Phe His Tyr Phe Pro 
180 185 ISO 

25 aat ggg tec ctg gtc cca ggt teg gee acc tgt acg gtc ate aag ccc 681 

Asn Gly Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val He Lys Pro 
195 200 205 

atg tgg ate tac aat ttc ate ate cag gtc acc tec ttc eta ttc tac 729 

30 Met Trp He Tyr Asn Phe He He Gin Val Thr Ser Phe Leu Phe Tyr 

210 215 220 225 

etc etc ccc acg act grc ate agt grc etc tac tac etc atg gca etc 777 

Leu Leu Frc Met Thr Val He Ser Val Leu Tyr Tyr Leu Met Ala Leu 

35 230 235 240 

aga eta aag aaa gac aaa tct ctt gag gca gat gaa ggg aat gca aat 825 

Arg Leu Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala Asn 
245 250 255 

40 

art caa aga ccc tgc aga aaa tea gtc aac aag atg ctg ttt gtc ttg 873 

He Gin Arg Pro Cys Arg Lys Ser Val Asn Lys Met Leu Phe Val Leu 
260 265 270 

45 gtc tta gtg ttt get arc tgt tgg gee ccg ttc cac att gac cca etc 921 

Val Leu Val Fhe Ala He Cys Trp Ala Pro Phe His He Asp Arg Leu 
275 230 2B5 

ttc rrc age rrr gtg gag gag tgg agr gaa tec ctg get get grg etc 963 

5C Fhe Phe Ser Fhe Val Glu Glu Trp Ser Glu Ser Leu Ala Ala Val Fhe 

2 9C 21-1 2ZZ - - 

aac err gee car gtg g:: tea ggr gre tire rrr rac erg age rea ger 

Asr. Leu Val His Val Val Ser Glv Val rhe Pre Tvr leu Ser Ser Ai a 

55 -10 is n: 
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1113 



1161 



1209 



etc aac ccc att ate tat aac eta ctg tct cgc cgc ttc cag gca gca 1C65 
Val Asn Pro He He Tyr Asn Leu Leu Ser Arg Arg Phe Gin Ala Ala 
325 330 335 

tec cag aat gtg ate tct tct ttc cac aaa cag tgg cac tec cag cat 
Phe Gin Asn Val He Ser Ser Phe His Lys Gin Trp His Ser Gin His 
340 345 350 

gac cca cag ttg cca cct gee cag egg aac ate ttc ctg aca gaa tgc 
Asp Pro Gin Leu Pro Pro Ala Gin Arg Asn He Phe Leu Thr Glu Cys 
355 360 365 

cac ttt gtg gag ctg acc gaa gat ata ggt ccc caa ttc cca tgt cag 
His Phe Val Glu Leu Thr Glu Asp He Gly Pro Gin Phe Pro Cys Gin 
370 375 380 385 

tea tec atg cac aac tct cac etc cca aca gee etc tct agt gaa cag 1257 
Ser Ser Met His Asn Ser His Leu Pro Thr Ala Leu Ser Ser Glu Gin 
390 395 400 

atg tea aga aca aac tat caa age ttc cac ttt aac aaa acc 1299 
Met Ser Arg Thr Asn Tyr Gin Ser Phe His Phe Asn Lys Thr 
405 410 415 

tgaattcttt cagagctgac tctcctctat gcctcaaaac ttcagagagg aacatcccat 1359 

aatgtatgcc ttctcatatg atattagaga ggtagaatgg ctcttacaac tcatgtaccc 14 19 

actgetagtt tttttttttt aataaacgtg aaaactgaga gttagatctg gtttcaaaac 14^5 

ccaagactgc ctgattttta gttatctttc cactatccta actgcctcat gccccttcac 1539 

tagctcatgc caagaaegtg actggaaagg catggcacct ataccttgat taatttccat 1599 

taatggaaat ggttcgtcct gagtcatcta cgttccgagt caggctgtca ctcctacta 1658 



<210> 2 
<211> 415 
<212> PRT 

<213> Homo sapiens 
<4 00 > 2 

Met Ser Gly Met Glu Lys Leu Gin Asn Ala Ser Trp He Tyr Gin Gin 
1 ' 5 10 15 

Lys Leu Glu Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr 
2 0 2 5 3 0 
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Asn Tyr Tyr leu ?he Ser Leu Ala Val 5er Asp Leu Leu Val Leu Leu 
S5 SO 55 

Leu Gly Mei Pre Leu Glu Val Tyr Glu Met Trp Arg Asn Tyr Fro ?he 
1C3 105 no 

Leu Phe Gly Pre Val Gly Cys Tyr Phe Lys Thr Ala Leu Fhe Glu Thr 
115 120 125 

Val Cys Phe Ala Ser He Leu Ser He Thr Thr Val Ser Val Glu Arg 
130 135 140 

Tyr Val Ala He Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg 
145 150 155 160 

Arc Arg Ala Leu Arg He Leu Gly He Val Trp Gly Phe Ser Val Leu 
165 170 175 

Phe Ser Leu Pro Asn Thr Ser He His Gly He Lys Phe His Tyr Phe 
ISO 185 190 

Pro Asn Gly Ser Leu Val Fro Gly Ser Ala Thr Cys Thr Val He Lys 
195 200 205 

Pre Met Trp He Tyr Asn Phe He He Gin Val Thr Ser Phe Leu Fhe 
210 215 220 

Tyr Leu Leu Pro Met Thr Val He Ser Val Leu Tyr Tyr Leu Met Ala 

225 220 235 240 

Leu Arc Leu Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala 
245 250 255 

Asn He Gin Arg Fro Cys .Arc Lys Ser Val Asn Lys Met Leu Phe Val 
25 0 265 270 

Leu Val Leu Val Fhe Ala He Cys Trp Ala Fro Fhe His He Asp Arg 
2^5 280 285 

Leu Phe Fhe Ser Fhe Val Glu Glu Trp Ser Glu Ser Leu Ala Ala Val 
2 30 295 300 
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His Asp Pro Gin Leu Fro Pro Ala Gin Arc Asn He Phe Leu Thr Glu 
355 35 0 365 

Cys His Phe Val Glu Leu Thr Giu Asp He Gly Pre Gin Phe Pre Cys 
5 37 0 375 330 

Gin Ser Ser Met His Asn Ser His Leu Pre Thr Ala Leu Ser Ser Glu 

355 390 395 400 

10 Gin Met Ser Arg Thr Asn Tyr Gin Ser Phe His Phe Asn Lys Thr 

405 410 415 



15 <210> 3 

<211> 1658 

<212> DNA 

<213> Homo sapiens 

20 <220> 

<2 2 1> CDS 

<222> (64) . . (1299) 

<223> IGS4A short version 



25 



156 



204 



<400> 3 

ggctcagctt gaaacagagc ctcgtaccag gggaggctca ggccttggat tttaatgtca 60 

ggg atg gaa aaa ctt cag aat get tec tgg ate tac cag cag aaa eta 108 

30 Met Glu Lys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin Lys Leu 

15 10 15 

gaa gat cca etc cag aaa cac ctg aac age acc gag gag tat ctg gec 

Glu Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr Leu Ala 

35 20 25 30 

ttc etc tgc gga cct egg cgc age cac ttc ttc etc ccc gtg tct gtg 

Phe Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val Ser Val 

3 5 4 0 4 5 

40 

gtg tat gtg cca att ttt gtg gtg ggg gtc att ggc aat gtc ctg gtg 252 

Val Tyr Val Pro lie Phe Val Val Gly Val He Gly Asn Val Leu Val 

50 55 60 

45 tgc ctg gtg att ctg cag cac cag get atg aag acg ccc acc aac tac 3 C 2 

Cys Leu Val He Leu Gin His Gin Ala Met Lys Thr Pro Thr Asn Tyr 

6 5 7 0 7 5 

tac etc ttc age ctg gcg gtc tct cac etc ctg gtc ctg etc ctt gga 345 

50 Tyr Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu Leu Gly 



Met 
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:tc ccc arc etc age ate acc acc grc ace etc gag cgc tac gtg 492 

Phe Ala Ser lie Leu Ser He Thr Thr Val Ser Val Glu Arc Tyr Val 
130 135 140 

10 gec ate eta cac ccg ttc cgc gec aaa ctg cag age acc egg cgc egg E4C 

Ala He Leu His Fro ?he Arg Ala Lys Leu Gin Ser Thr Arg Arg Arg 
145 150 155 

gec etc agg ate etc ggc ate gtc tgg ggc ttc tec gtg etc ttc tec 5S8 

15 Ala Leu Arg He Leu Gly He Val Trp Gly Phe Ser Val Leu Phe Ser 

160 165 170 175 

ctg ccc aac acc age ate cat ggc ate aag ttc cac tac ttc ccc aat 636 

Leu Pro Asn Thr Ser He His Gly He Lys Phe His Tyr Phe Pro Asn 
20 180 185 130 

ggg tec ctg gtc cca get teg gec acc tgt acg gtc ate aag ccc atg 684 

Gly Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val He Lys Pro Met 
195 200 205 

25 

tgg ate tac aat ttc ate ate cag gtc acc tec ttc eta ttc tac etc 732 

Trp He Tyr Asn Phe He He Gin Val Thr Ser Phe Leu Phe Tyr Leu 

210 215 220 

30 etc ccc atg act etc ate agt gec etc tac tac etc atg gca etc aga "50 

Leu Pre Met Thr Val He Ser Val Leu Tyr Tyr Leu Met Ala Leu Arg 
225 230 235 

eta aag aaa gac aaa tet ctt gag gca gat gaa ggg aat gca aat att 228 

35 Leu Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala Asn lie 

240 245 250 255 

caa aga ccc tgc aga aaa tea gtc aac aag atg ctg ttt gtc ttg gtc 876 

Gin Arg Pro Cys Arg Lys Ser Val Asn Lys Met Leu Phe Val Leu Val 
40 260 265 2-70 

tta gtg ttt get ate tgt tgg gec ccg ttc cac att gac cga etc ttc 524 
Leu Val Phe Ala He Cys Trp Ala Pro Phe His He Asp Arg Leu Phe 
2^5 280 235 



45 



50 



ttc age ttt gtg gag gag tgg agt caa tec ctg get get gtg ttc aac 
?ne Ser Phe Val Glu Glu Trp Ser Glu Ser Leu Ala Ala Val Phe Asn 

2 9 5 5 0 0 



• c - 



20 



30 



45 DIR 0566P 



:■ 6 s 



aac ccc att ate tat aac eta etc tet cgc cgc tte cag gca gca ttc 

Asn Pro He He Tyr Asn Leu Leu £er Arg Arg Fhe Gin Ala Ala Phe 
320 325 330 335 

5 cag aat gtg ate tet tet tte cac aaa cag tgg cac tec cag cat gac 

Gin Asn Val He Ser Ser Phe Kis Lys Gin Trp His Ser Gin His Asp 
3 40 345 350 

cca cag ttg cca cct gee cag egg aac ate ttc ctg aca gaa tgc cac 

10 Pro Gin Leu Pro Pro Ala Gin Arg Asn He Phe Leu Thr Glu Cys His 

355 360 365 

ttt gtg gag ctg ace gaa gat ata ggt ccc caa ttc cca tgt cag tea 1212 

Phe Val Glu Leu Thr Glu Asp He Gly Pro Gin Phe Pro Cys Gin Ser 

15 370 375 380 



1116 



1164 



tec atg cac aac tet cac etc cca aca gec etc tet agt gaa cag atg 

Ser Met His Asn Ser His Leu Pro Thr Ala Leu Ser Ser Glu Gin Met 

385 390 395 

tea aga aca aac tat caa age ttc cac ttt aac aaa acc tgaattcttt 

Ser Arg Thr Asn Tyr Gin Ser Phe His Phe Asn Lys Thr 

400 405 410 



1260 



1309 



25 cagagctgac tctcctctat gcctcaaaac ttcagagagg aacatcccat aatgtatgcc 1369 
ttctcatatg atattagaga ggtagaatgg ctcttacaac tcatgtaccc attgetagtt 1429 
tttttttttt aataaacgtg aaaactgaga gttagatctg gtttcaaaac ccaagactgc 14B9 
ctgattttta gttatctttc eactatccta actgcctcat gccccttcac tagttcatgc 1549 
caagaaegtg aetggaaagg catggcacct ataccttgat taatttccat taatggaaat 16C9 
35 ggttcgtcct gagtcatcta cgttccgagt caggctgtca ctcctacta 1658 

<210> 4 

<211> 412 

40 <212> PET 

<213> Homo sapiens 

<400> 4 

Met Glu Lys Leu Gin Asn Ala Ser Trp He Tyr Gin Gin Lys Leu Glu 
45 1 5 10 15 

Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr Leu Ala Phe 

20 25 30 

50 Leu Gvs Glv Pro Ara Arc Ser His Fhe Phe Leu Pro- Val Ser Val Val 



C 3 



5o 
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Leu Fhe Ser Leu -la Val Ser Asn leu leu Val leu leu leu Gly Met 
5 65 50 55 

Fro leu Glu Val Tyr Glu Met Trp Arg Asr. Tyr Fro Phe Leu ?he Gly 
ICO 105 i 1 0 

10 Fro Val Gly Cys Tyr Phe Lys Thr Ala leu Phe Glu Thr Val Cys Fhe 
115 12C 125 

Ala Ser He leu Ser He Thr Thr Val Ser Val Glu Arg Tyr Val Ala 
130 135 140 



15 



30 



45 



He Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg Arc Arg Ala 
145 150 155 160 



Leu Arg lie Leu Gly He Val Trp Gly Phe Ser Val Leu Phe Ser Leu 

20 165 170 175 

Fro Asn Thr Ser He His Gly He Lys Phe His Tyr Phe Pre Asn Gly 

180 185 190 

25 Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val He Lys Pro Met Trp 

195 200 205 



He Tyr Asn Phe He He Gin Val Thr Ser Phe Leu Phe Tyr Leu Leu 

2 10 215 220 

Pro Met Thr Val He Ser Val Leu Tyr Tyr Leu Met Ala Leu Arg Leu 
225 220 235 24 0 



Lvs Lvs As d Lvs Ser leu Glu Ala Asp Glu Gly Asn Ala Asn He Gin 

35 245 250 255 

Arg Pro Cys Arg Lys Ser Val Asn Lys Met Leu Phe Val Leu Val Leu 
260 265 27 0 

40 Val Fhe Ala He Cys Trp Ala Pro Phe His He Asp Arg Leu Phe Phe 

2 ~ 5 28 0 285 



Ser Fhe Val Glu Glu Trp Ser Glu Ser Leu Ala Ala Val Fhe Asn leu 

250 295 300 

Val His Val Val Ser Gly Val Fhe Fhe Tyr leu Ser Ser Ala Val Asn 

2C5 3 1 C 3 15 3 2 0 



Pro He He rvr Asn leu Leu Ser Arg Ara Fhe Gin Ala Ala Phe Gin 
50 H5 2 2 0 3 3 5 
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Gin Leu Fro Pre Ala Glr. Arg Asn lie Fhe Leu Thr Glu Cys Kis Phe 

255 36C 365 

Val Glu Leu Thr Glu Asp lie C-ly Pro Gin Phe Pro Cys Gin Ser Ser 

370 375 380 

Met His Asn Ser His Leu Pro Thr Ala Leu Ser Ser Glu Gin Met Ser 

385 390 395 400 



Arg Thr Asn Tyr Gin Ser Phe His Phe Asn Lys Thr 
405 410 



<210> 5 
<211> 1658 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (55) . - (1299) 

<223> IGS4B long version 

<400> 5 

qqetcagett gaaacagagc ctcgtaccag gggaggctca ggccttggat ttta atg 

Met 
1 

tea ggg atg gaa aaa etc cag aat get tec tgg ate tac cag cag aaa 105 
Ser Gly Met Glu Lys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin Lys 
5 ^ 10 15 

eta gaa gat cca ttc cag aaa cac ctg aac age ace gag gag tat ctg 153 
Leu Glu Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr Leu 
2 0 2 5 3 0 

gee ttc etc tgc gga cct egg cgc age cac ttc ttc etc ccc gtg tct 201 
Ala Phe Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val Ser 
3 5 4 0 4 5 

gtg gtg tat gtg cca att ttt gtg gtg ggg gtc att ggc aat gtc ctg 249 
Val Val Tyr Val Pro lie Phe Val Val Gly Val He Gly Asn Val Leu 
50 55 60 65 

gtg tgc ctg gtg att ctg cag cac cag get atg aag acg ccc ace aac 297 
Val Cys Leu Val He Leu Gin His Gin Ala Met Lys Thr Pro Thr Asn 
70 75 80 

tac tac etc ccc age ccg gcg gtc tct gac etc ctg gtc ctg etc ctt 345 
Tvr Tvr Leu Phe Ser Leu Ala Val Ser Ase Leu Leu Val Leu Leu Leu 
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gga atg ccc etc gag etc tat gag atg tgg cgc aac tac cct ttc ttg 393 

Gly Met Pre Leu Glu Val Tyr Glu Met Trp Arg ;.sr: Tyr Fro Phe Leu 

1GC 105 110 

ttc ggg ccc gtg gee tgc tac ttc aac acg gec etc ttt gag ace gtg 441 

Fhe Gly Pre "a I Gly Cys Tyr Phe Lys Thr Ala Leu ?he Glu Thr Val 
115 120 125 

tgc ttc ccc tec ate etc age ate aec ace etc age gtg gag cgc tac 459 

Cys Phe Ala Ser lie Leu Ser lie Thr Thr Val Ser Val Glu Arg Tyr 
13 0 13 5 14 0 14 5 

gtg gee ate eta cac ccg ttc cgc gee aaa ctg cag age aec egg cgc 537 

Val Ala lie Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg Arg 
150 155 160 

egg gee etc agg ate etc ggc ate gtc tgg ggc ttc tec gtg etc ttc 585 

Arg Ala Leu Arg lie Leu Gly lie Val Trp Gly Phe Ser Val Leu Phe 
165 170 175 

tec ctg ccc aac acc age ate cat ggc ate aag ttc cac tac ttc ccc 633 

Ser Leu Pro Asn Thr Ser lie His Gly lie Lys Phe His Tyr Phe Pro 

180 185 190 

aat ggg tec ctg gtc cca ggt teg gec acc tgt acg gtc ate aag ccc 681 

Asn Gly Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val He Lys Pro 

195 200 205 

a ~9 tgg ate tac aat ttc ate ate cag gtc acc tec ttc eta ttc tac "29 

Met Trp lie Tyr Asn Phe lie He Gin Val Thr Ser Phe Leu Phe Tyr 
22 0 2 15 220 225 

etc etc ccc atg act gtc ate act etc etc tac tac etc atg cca etc 77" 

Leu Leu Pre Met Thr Val He Ser Val Leu Tyr Tyr Leu Met Ala Leu 
230 235 240 

aga eta aag aaa gac aaa tct ctt gag gca gat gaa ggg aat gca aat 825 

Arg Leu Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala Asn 
245 250 255 

att caa aga eee tgc aga aaa tea gtc aac aag atg ctg ttt gtc ttg B73 

lie Gin Arc Pro Cys Arg Lys Ser Val Asn Lys Met Leu Phe Val Leu 

26C 265 270 

ere cca etc cct get ate tgt tgg gec ccg ttc cac att gac cga etc 921 

Val Leu Val Phe Ala He Cys Trp Ala Fro Phe His lie Asp Arg Leu 
2^5 28 0 285 

tec ttc age ttt ctg gag cac tgg act gaa tec ctg get get ctg ttc 969 

Pee Phe Ser He Val Glu Glu Trp Thr Glu Ser Leu Ala Ala Val Pre 

290 z 3 5 300 3 0 5 



4S DIR 0566P 



etc aac ccc att ate tat aac eta ctg tct cgc cgc ttc cag gca gca 
Val Asn Pro He He Tyr Asn Leu Leu Ser Arc Aurg Fhe Gin Ala Ala 



325 330 

ttc cag aat gtg ate tct tct ttc cac aaa cag tgg cac tec cag cat 1113 
Phe Gin Asn Val He Ser Ser Phe His Lys Gin Trp His Ser Gin His 
340 345 350 

gac cca cag ttg cca cct gee cag egg aac ate ttc ctg aca gaa tgc 1161 
Asp Pro Gin Leu Pro Pro Ala Gin Arg Asn He Phe Leu Thr Glu Cys 
355 360 365 

cac ttt gtg gag ctg ace gaa gat ata ggt ccc caa ttc eta tgt cag 1209 
His Phe Val Glu Leu Thr Glu Asp He Gly Pro Gin Phe Leu Cys Gin 
370 375 ' 380 385 

tea tec gtg cac aac tct cac etc cca aca gee etc tct agt gaa cag 1257 
Ser Ser Val His Asn Ser His Leu Pro Thr Ala Leu Ser Ser Glu Gin 
390 395 400 

atg tea aga aca aac tat caa age ttc cac ttt aac aaa ace 1299 
Met Ser Arg Thr Asn Tyr Gin Ser Phe His Phe Asn Lys Thr 
405 410 415 

tgaattcttt cagagctgac tctcctctat gcctcaaaac ttcagagagg aacatcccat 1359 

aatgtatgcc ttetcatatg aaattagaga ggtagaatgg ctcttacaac tcatgtaccc 1419 

attgetagtt tttttttttt aataaacgtg aaaactgaga gttagatctg gtttcaaaac 1479 

ccaagactgc ctgattttta gttatctttc cactatccta actgcctcat gccccttcac 1539 

tagttcatgc caagaaegtg actggaaagg catggcacct ataccttgat taatttccat 1599 

taatggaaat ggttcgtcct gagtcatcta cgttccgagt caggctgtca ctcctacta 1658 



<210> 6 
<211> 4 15 
<212> PRT 

<213> Homo sapiens 
<400> 6 

Met Ser Gly Met Glu Lys Leu Gin Asn Ala Ser Trp He Tyr Gin Gin 
15 10 15 



Gin Lys His Leu Asn Ser Thr Glu Glu Tyr 
- 5 3 0 



50 
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Leu Val Cvs leu Val lie Leu Gin His Gin Ala Met Lys Ihr Pre .hr 
53 " t: "5 b: 

Ash Tyr Tyr leu Phe Ser leu Ala Val Ser Asp Leu Leu Val Leu Leu 
65 90 95 

Leu Gly Met Pro Leu Glu Val Tyr Glu Met Trp Arg Asa Tyr Pro Fhe 
100 1C5 11C 

Leu Phe Gly Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr 
115 120 125 



Val Cys Phe Ala Ser He Leu Ser He Thr Thr Val Ser Val Glu Arg 
15 130 135 140 

Tvr Val Ala He Leu His Pro Fhe Arg Ala Lys Leu Gin Ser Thr Arg 
145 150 155 160 

20 Arg Aurg Ala Leu Arg He Leu Gly He Val Trp Gly Phe Ser Val Leu 

165 170 175 



25 



Phe Ser Leu Pro Asn Thr Ser lie His Gly He Lys Phe His Tyr Phe 

180 185 190 

Pro Asn Gly Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val He Lys 

195 200 205 



Pro Met Trp lie Tyr Asn Phe He He Gin Val Thr Ser Phe Leu Phe 

30 210 215 220 

Tyr Leu Leu Pro Me: Thr Val He Ser Val Leu Tyr Tyr Leu Met Ala 

225 23 0 235 24 0 

35 Leu Arg Leu Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala 

245 250 255 



40 



Asn He Gin Arg Pro Cys Arg Lys Ser Val Asn Lys Met Leu Phe Val 
260 265 270 

Leu Val Leu Val Phe Ala He Cys Trp Ala Pro Phe His He Asp Arg 
2~5 280 265 



Leu Phe Phe Ser Fhe Val Glu Glu Trp Thr Glu Ser Leu Ala Ala Val 
45 290 295 300 

1ZS 3i: " 315 320 

50 Ala Val Asn Pre He lie Tvr Asn Leu Leu Ser .Arg Arg Phe Gin Ala 

225 330 215 
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m # 



His Asp Pre Gin Leu Fro Pre Ala Gin Arg Asn He Fhe Leu Thr Glu 
355 360 365 

Cvs His Fhe Vai Glu Leu Thr Glu Asp lie Gly Pro Gin Fhe Leu Cys 
370 375 380 

Gin Ser Ser Vai His Asn Ser His Leu Pro Thr Ala Leu Ser Ser Glu 
335 390 395 400 

Gin Met Ser Arg Thr Asn Tyr Gin Ser Phe His Phe Asn Lys Thr 
405 410 415 



<210> 7 
<211> 1658 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> CDS 

<222> (64) . . (1299) 

<223> IGS4B short version 

<400> 7 

ggctcagctt gaaacagagc ctcgtaccag gggaggctca ggccttggat tttaatgtca 

ggg atg gaa aaa ctt cag aat get tec tgg ate tac cag cag aaa eta 
Met Glu Lys Leu Gin Asn Ala Ser Trp He Tyr Gin Gin Lys Leu 
I 5 10 15 

gaa gat cca ttc cag aaa cac ctg aac age ace gag gag tat ctg gee 
Glu Asp Pro Phe Gin Lvs His Leu Asn Ser Thr Glu Glu Tyr Leu Ala 

20 25 30 

ttc etc tgc gga cct egg cgc age cac ttc ttc etc ccc gtg tct gtg 
Phe Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Vai Ser Vai 
35 40 45 

gtg tat gtg cca att ttt gtg gtg ggg gtc att ggc aat gtc ctg gtg 252 
Vai Tyr Vai Pro lie Phe Vai Vai Gly Vai He Gly Asn Vai Leu Vai 
50 55 60 

tgc ctg gtg ate ctg cag cac cag get atg aag acg ccc ace aac tac 3C0 
Cys Leu Vai lie Leu Gin His Gin Ala Met Lys Thr Pro Thr Asn Tyr 
65 70 75 

tac etc ttc age ctg gcg gtc tct gac etc ctg gtc ctg ccc ctt gga 248 
Tyr Leu Fhe Ser Leu Ala Vai Ser Asp Leu Leu Vai Leu Leu Leu Gly 

£0 8 5 9 0 9 5 



60 
108 

156 

204 
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5 etc c:c tec acc etc age ate acc acc gtc age gtg gag cgc tac etc 
?he Ala Ser lie leu Ser He Thr Thr Val Ser Val Glu Arg Tyr Val 
13 C 13 5 14 0 

gec ate eta cae ccg tte cgc gee aaa ctg cag age acc egg cgc egg 
10 Ala He leu His Fro Phe Arg Ala Lys Leu Gin Ser Thr Arg Arg Arg 
145 150 155 

gee etc agg ate etc ggc ate gtc tgg ggc ttc tec gtg etc tte tec 
Ala Leu Arg He Leu Gly He Val Trp Gly Phe Ser Val Leu Phe Ser 
15 160 165 170 175 

ctg ccc aac acc age ate cat ggc ate aag ttc cae tac ttc ccc aat 
Leu Pro Asn Thr Ser He Kis Gly He Lys Phe His Tyr Phe Pro Asn 
180 185 190 



20 



ggg tec ctg gtc cca ggt teg gee acc tgt acg gtc ate aag ccc atg 
Gly Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val He Lys Pro Met 
195 200 205 



540 



588 



636 



684 



25 tgg ate tac aat ttc ate ate cag gtc acc tec ttc eta ttc tac etc 732 

Trp He Tyr Asn Phe He He Gin Val Thr Ser Phe Leu Phe Tyr Leu 
210 215 220 

etc ccc atg act gtc ate agt gtc etc tac tac etc atg gca etc aga 780 

30 Leu Pro Ket Thr Val lie Ser Val Leu Tyr Tyr Leu Met Ala Leu Arg 

225 230 235 

eta aag aaa gsc aaa tct ctt gag gca gat gaa ggg aat gca aat att 528 

Leu Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala Asn He 
35 240 245 25C 255 

caa aca ccc tgc aga aaa tea gtc aac aag atg ctg ttt gtc ttg gtc 876 

Gin Arg Pre Cys Arg Lys Ser Val Asn Lys Met Leu Phe Val Leu Val 
260 " 265 270 

40 

tta gtc ttt get ate tgt tgg gee ccg ttc cac att gac cga etc ttc S24 

Leu Val Phe Ala He Cys Trp Ala Pro Phe Kis He Asp Arg Leu Phe 
2"5 280 285 

45 ttc age ttt ctg gag gag tgg act gaa tec ctg get get gtg ttc aac 972 

Phe Ser Phe Val Glu Glu Trp Thr Glu Ser Leu Ala Ala Val Phe Asn 
2? 2 2S5 300 

etc gtc cat erg gtg tea ggt gtc tta ttc tac ctg age tea get etc 112 
50 Leu Val His Val Val Ser Gly Val Leu Phe Tyr Leu Ser Ser Ala Val 
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25 



1164 



1212 



1260 



1309 



cag aat gtg acc tct tct ttc cac aaa cag tgg cac tec cag cat gac 

Gin Asn Val He Ser Ser ?he His Lys Gin Trp His Ser Gin His Asp 
340 345 350 

5 

cca cag ttg cca cct gec cag egg aac ate ttc ctg aca gaa tgc cac 

Fro Gin Leu ?rc Pro Ala Gin Arg Asn lie Fhe Leu Thr Giu Cys His 
355 360 365 

10 ttt gtg gag ctg acc gaa gat ata ggt ccc caa ttc eta tgt cag tea 
Fhe Val Giu Leu Thr Glu Asp He Gly Pro Gin Phe Leu Cys Gin Ser 
37 0 375 380 

tec gtg cac aac tct cac etc cca aca gec etc tct agt gaa cag atg 
15 Ser Val His Asn Ser His Leu Pro Thr Ala Leu Ser Ser Glu Gin Met 
385 390 395 

tea aga aca aac tat caa age ttc cac ttt aac aaa acc tgaattcttt 
Ser Arg Thr Asn Tyr Gin Ser Phe His Phe Asn Lys Thr 
20 4 00 4 05 410 

cagagctgac tctcctctat gcctcaaaac ttcagagagg aacatcccat aatgtatgcc 1369 

ttctcatatg aaattagaga ggtagaatgg ctcttacaac tcatgtaccc attgetagtt 1429 

tttttttttt aataaacgtg aaaactgaga gttagatctg gtttcaaaac ccaagactgc 1489 

ccgattttta gttatctttc cactatccta actgcctcat gccccttcac tagttcatgc 1549 

30 caagaaegtg aceggaaagg satggcacct ataccttgat taatttccat taatggaaat 1609 

ggttcgtcct gagtcatcta cgttccgagt caggctgtca ctcctacta 1658 

35 <21C> 8 

<211> 412 
<212> PRT 

<2 13> Hcrr.c sapiens 
40 <4CC> 8 

Met Glu Lys Leu Gin Asn Ala Ser Trp He Tyr Gin Gin Lys Leu Glu 
15 10 15 

Asp Pro Fhe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr Leu Ala Phe 
45 2 0 2 5 3 0 

Leu Cys Gly Pre Arg Arg Ser His Phe Phe Leu Pro Val Ser Val Val 
35 40 45 

50 Tyr Val Fro He Fhe Val Val Gly Val He Gly Asn Val Leu Val Cys 

- 0 n : 3 6 0 
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Leu Phe Ser leu Ala Val ^er Asp Leu Leu Val Leu Leu Leu Gly Met 
5 5 3 0 3 3 

Pre Leu Glu Val Tyr Glu Me: Trp Arc Asr. Tyr Pre Phe Leu Phe Gly 

5 ICC 105 no 

Pre Val Gly Gys Tyr Phe Lys Thr Ala Leu Phe Glu Thr Val Cys Phe 
115 120 125 

10 Ala Ser lie Leu Ser He Thr Thr Val Ser Val Glu Arg Tyr Val Ala 
13 C 13 5 14 0 

He Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg Arg Arg Ala 

145 150 155 160 



15 



30 



45 



50 



Leu Arg He Leu Gly He Val Trp Gly Phe Ser Val Leu Phe Ser Leu 
165 170 175 



Pro Asr. Thr Ser He His Gly He Lys Phe His Tyr Phe Pro Asn Gly 

20 180 185 130 

Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val He Lys Pro Met Trp 
195 200 205 

25 He Tyr Asn Phe lie lie Gin Val Thr Ser Phe Leu Phe Tyr Leu Leu 

210 215 220 



Fro Met Thr Val He Ser Val Leu Tyr Tyr Leu Met Ala Leu Arg Leu 

225 230 235 240 

Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala Asn He Gin 

245 250 255 



Arg Pre Cys Arg Lys Ser Val Asn Lys Met Leu Phe Val Leu Val Leu 

35 130 " 265 2"C 

Val Phe Ala lie Cys Trp Ala Pro Phe His He Asp Arg Leu Phe Phe 

2^5 280 235 

40 Ser Phe Val Glu Glu Trp Thr Glu Ser Leu Ala Ala Val Phe Asn Leu 

23C 2S5 300 



Val His Val Val Ser Gly Val Leu Phe Tyr Leu Ser Ser Ala Val Asn 

305 310 315 220 

Pre He He Tyr Asn Leu Leu Ser Arg Arg Phe Gin Ala Ala Phe Gin 

315 320 32 5 
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Val Glu Leu Thr Glu Asp He Gly Pro Gin Fhe Leu Cys Gin Ser Ser 

370 375 330 

Val His Asn Ser His Leu Fro Thr Ala Leu Ser Ser Glu Gin Met Ser 
385 3S0 395 400 

Arg Thr Asn Tyr Gin Ser Phe His Phe Asn Lys Thr 
405 410 



<210> 9 

<211> 1594 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (55) . . (942) 

<223> IGS4A truncated DNA long version 
<400> 9 

ggctcagctt gaaacagagc ctcgtaccag gggaggctca ggccttggat ttta atg 57 

Met 
1 

tea ggg atg gaa aaa ctt cag aat get tec tgg ate tac cag cag aaa 105 
Ser Gly Met Glu Lys Leu Gin Asn Ala Ser Trp He Tyr Gin Gin Lys 
' S 10 15 

eta gaa cat cca ttc cag aaa cac ctg aac age ace gag gag tat ctg 153 
Leu Glu Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr Leu 
20 25 30 



ace ttc etc tgc gga cct egg cgc age cac ttc ttc etc ccc gtg tct 

Ala Phe Leu Cys Gly Pre Arg Arg Ser His Phe Phe Leu Pro Val Ser 

35 40 45 

gtg gtg tat gtg cca att ttt gtg gtg ggg gtc att ggc aat etc ctg 

Val Val Tyr Val Pro He Phe Val Val Gly Val He Gly Asn Val Leu 

5 0 53 60 65 

gtg tgc ctg gtg att ctg cag cac cag get atg aag acg ccc acc aac 

Val Cys Leu Val He Leu Gin His Gin Ala Met Lys Thr Pro Thr Asn 

7 0 7 5 8 0 

tac tac etc t:c age ctg gcg gtc tct gac etc ctg gtc ctg etc ctt 

Tyr Tyr Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu Leu 

£5 50 5 5 



201 



249 
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::c ggg :cc gtg ggc tec tac ttc aac ace gee etc ttr gag ate etc 

Pee Gly Pre Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr Val 

115 120 125 

tgc ttc gee tee ate ete age ate aec acc gte age gtg gag egc tae 

Cys ?he Ala Ser He Lee Ser He Thr Thr Val Ser Val Glu Arg Tyr 

130 135 140 145 



gtg gee ate eta cac ceg ttc cgc gee aaa ctg cag age acc egg egc 537 

Val Ala He Leu His Fro Phe Arg Ala Lys Leu Gin Ser Thr Arg Arg 
15 0 15 5 16 0 

egg gee etc agg ate etc ggc ate gtc tgg ggc ttc tec gtg etc ttc 585 

Arg Ala Leu Arg He Leu Gly He Val Trp Gly Phe Ser Val Leu Phe 
165 170 175 

tec ctg ccc aac acc age ate cat ggc ate aag ttc cac tac ttc ccc 633 

Ser Leu Pro Asn Thr Ser He His Gly He Lys Phe His Tyr Phe Pro 
180 185 190 

aat ggg tec ctg gtc cca ggt teg gee acc tgt acg gtc ate aag ccc 681 

Asn Gly Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val He Lys Pro 

195 200 205 

atg tgg ate tac aat ttc ate ate cag gtc acc tec ttc eta ttc tac 729 

Met Trp He Tyr Asn Phe He He Gin Val Thr Ser Fhe Leu Phe Tyr 

210 215 220 225 

etc etc ccc atg act gte ate agt gte etc tac tac etc atg gea etc 777 

Leu Leu Pro Met Thr Val He Ser Val Leu Tyr Tyr Leu Met Ala Leu 
230 235 240 

aga eta aag aaa gac aaa tct ctt gag gea gat gaa ggg aat gea aat 825 

Arg Leu Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala Asn 
245 250 255 

att caa aga ccc tgc aga aaa tea gtc aac aag atg ctg tct ttg tgg 873 

He Gin Arg Pro Cys Arg Lys Ser Val Asn Lys Met Leu Ser Leu Trp 
260 265 270 



agg agt gga gtg aat eee tgg ctg ctg tgt tea acc teg tec atg tgg 
Arg Ser Gly Val Asn Pre Trp Leu Leu Cys Ser Thr Ser Ser Met Trp 
275 280 285 



tgt cag gtg tct tct tct acc tgagcteagc tgtcaaccec attatctata 

Cys Gin Val Ser Ser Ser Thr 
290 295 



aeetaetgtc tegtegctte caggcagcat teeagaatgt gatctcttct tteeacaaae 1132 

agtggeaetc ceageatgae teacagttge eacctgeeea geggaaeate ttcetgaeag 1Z92 

astgecaett ~g~ggagctg aeegaagata raggtcceea =:t::ca:g: cagteateea 1151 

tgcacaaetc teatctccca acageectct ctagtgaaea gatgtcaaga ataaaetait 3212 



15 



25 



40 
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aaagcttcca ctttaacaaa acctgaattc tttcagagct gactctcctc ratgcctcaa 1272 

aacttcacac aggaacatcc ca:aa:gtat gccttctcat atgatattag agaggtagaa 13.* 2 

5 

tggctcttac aactcatgta cccattgcta gttttttttt tttaataaac gtgaaaactg l^Sz 

agagttagat ctggtttcaa aacccaagac tgcctgattt ttagttatct ttccactatc 1452 

10 ctaacrgcct catgcccctt cactagttca tgccaagaac gtgactggaa aggcatggca 1512 

cctatacctt gattaatttc cattaatgga aatggttcgt cctgagtcat ctacgttccg 1572 



15 94 

agtcaggctg tcactcctac ta 



<210> 10 

<211> 296 

<212> PRT 

20 <213> Homo sapiens 

<400> 10 

Met Ser Gly Met Glu Lys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin 
1 ' 5 10 15 



Lys Leu Glu Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr 

2 0 2 5 3 0 



Leu Ala Phe Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val 

30 35 40 4 5 

Ser Val Val Tyr Val Pre lie Phe Val Val Gly Val lie Gly Asn Val 

50 55 60 

35 Leu Val Cys Leu Val lie Leu Gin His Gin Ala Met Lys Thr Fro Thr 

65 70 75 80 



Asn Tyr Tyr Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu 

85 90 95 

Leu Gly Met Pre Leu Glu Val Tyr Glu Met Trp Arg Asn T/r Pro Phe 

i:o 105 no 



Leu Phe Gly Pre Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr 

45 115 120 125 

Val Cys Phe Ala Ser lie Leu Ser lie Thr Thr Val Ser Val Glu Arg 

130 135 140 

50 Tyr Val Ala lie Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg 

14 5 15C 155 150 



L rg Arg Ala Leu Arg lie Leu Gly -le Val Trp Gly .- re Ser '. ai Leu 
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?:.e Ser Leu Pre Asn Thr Ser He His Gly He Lys Phe His .yr F:.e 
1 5 C 15 5 I? 

Pre Asn Gly Ser Leu Val Pre Gly Ser Ala Thr Cys Thr Val He Lys 
155 2 00 205 

Pre Me: Trp He Tyr Asn Phe He He Gin Val Thr Ser Phe Leu Phe 
21C 215 22C 

Tyr Leu Leu Pre Met Thr Val He Ser Val Leu Tyr Tyr Leu Met Ala 
225 250 235 240 

Leu Arc Leu Lys Lys A.sp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala 
245 250 255 

Asn He Gin Arg Pro Cys Arg Lys Ser Val Asn Lys Met Leu Ser Leu 
260 265 270 

Trp Arg Ser Gly Val Asn Pro Trp Leu Leu Cys Ser Thr Ser Ser Met 
275 280 285 

Trp Cys Gin Val Ser Ser Ser Thr 
290 295 



< 2 1 0 > 11 
<211> 15S4 
<212> DNA 

<215> Komo sapiens 

<22C> 

<221> CLS 

<2 22 > ( £ 4 ) . . • 542 ; 

<22 3 > IGS4AH4 snort version 

< 4 C 0 > 11 

cgctcagctt gaaacagagc otcgtaccag gggaggctca ggccttggat tttaatgtca 60 



egg atg gaa aaa ctt cag aat get tec tgg ate tac cag cag aaa eta 

Met Glu Lys Leu Gin Asn Ala Ser Trp He Tyr Gin Gin Lys Leu 

5 10 15 

gaa gat cea ttc cag aaa cac cig aac age ace gag gag tat ctg gee 

Glu Asp Pre Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr Leu Ala 

20 25 ^3 

in etc ige gga eel egg cge age cac lie tie eic cce gig tet gig 

Phe Leu Cys Gly Pro Arg Arg Ser His Phe Pee Leu Pro Val Ser Val 

3 5 4 0 4 5 

gig 1 = 1 gig cea an in gig gig ggg g: e r, 1 1 ggc aat gn ctg gig 

Val Tvr Val Pr i He Phe VH G"v Val Tie 3 1 v Asn Val Leu VH 



108 



15 6 
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tec cts etc att ctg cag cac cag gc: 



ccc acc aac tac 2C0 



Cys Leu Val He Leu Gin His Gin Ala Met Lys Thr Frc Thr Asn Tyr 
65 7C 75 

tac etc ttc age ctg ccg gtc tct gac etc ctg gtc ctg etc ctt gga 348 
Tyr Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu Leu Gly 
80 85 90 95 

atg ccc ctg gag gtc tat gag atg tgg cgc aac tac cct ttc ttg ttc 396 
Met Pre Leu Glu Val Tyr Glu Met Trp Arg Asn Tyr Pro Phe Leu Phe 
100 105 HO 

ggg ccc gtg ggc tgc cac ttc aag acg gec etc ttt gag acc gtg tgc 
Gly Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr Val Cys 
115 120 125 

ttc gee tec ate etc age ate acc acc gtc age gtg gag cgc tac gtg 
Phe Ala Ser He Leu Ser He Thr Thr Val Ser Val Glu Arg Tyr Val 
130 135 140 

gee ate eta cac ccg ttc cgc gee aaa ctg cag age acc egg cgc egg 
Ala He Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg Arg Arg 
145 150 155 

gec etc agg ate etc ggc ate gtc tgg ggc ttc tec gtg etc ttc tec 
Ala Leu Arg He Leu Gly He Val Trp Gly Phe Ser Val Leu Phe Ser 
160 165 170 175 

ctg ccc aac acc age ate cat ggc ate aag ttc cac tac ttc ccc aat 
Leu Pro Asn Thr Ser lie His Gly He Lys Phe His Tyr Phe Pro Asn 
ISO 185 190 

ggg tec ctg ere cca ggt teg gec acc tgt acg gtc ate aag ccc atg 684 
Gly Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val He Lys Pro Met 
195 200 205 

tgg ate tac aat ttc ate ate cag gtc acc tec ttc eta ttc tac etc 732 
Trp He Tyr Asn Phe He He Gin Val Thr Ser Phe Leu Phe Tyr Leu 
210 215 220 

etc ccc atg act gtc ate agt gtc etc tac tac etc atg gca etc aga 780 
Leu Pro Met Thr Val He Ser Val Leu Tyr Tyr Leu Met Ala Leu Arg 

225 230 235 



eta aag aaa gac aaa tct ctt gag gca gat gaa ggg aat gca aat att 
Leu Lys Lvs Asn Lvs Ser Leu Glu Ala Asp Glu Gly Asn Ala Asn He 
' 74= 25C 255 



444 



492 



540 



588 



636 



828 
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15 



25 



40 



CSC C t C 1 C t t C t t C t c"C I-aCCt-SCC tgt Cc^aCCCC atw£.~C_ci_c 

Gin Val Ser Ser Ser Xr.r 



acctactctc t;occcct"c cacccaccat tccacaatcc gatctcttct ttccc 
actgccactc ccagcatgac ccacagttgc cacctgccca gcggaacatc ttcctgacag 1CS2 
10 aatgccactt tgtggagctg accgaagata taggtcccca attcccatgt cagtcatcca 1152 
tgcacaactc tcacctccca acagccctct ctagtgaaca gatgtcaaga acaaactatc 1212 
aaagcttcca ctttaacaaa acctgaattc tttcagagct gactctcctc tatgcctcaa 1272 
aacttcagag aggaacatcc cataatgtat gccttctcat atgatattag agaggtagaa 1332 
tggctcttac aactcatgta cccattgcta gttttttttt tttaataaac gtgaaaactg 1392 
20 agagttagat ctggtttcaa aacccaagac tgcctgattt ttagttatct ttccactatc 1452 
ctaactgcct catgcccctt cactagttca tgccaagaac gtgactggaa aggcatggca 1512 
cctatacctt gattaatttc cattaatgga aatggttcgt cctgagtcat ctacgttccg 1572 
agtcaggctg tcactcctac ta 



<210> 12 
30 <211> 293 
<212> PRT 

<213> Homo sapiens 
<400> 12 

35 Met Glu Lys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin Lys Leu Glu 
15 10 15 

Asp Pro Fhe Gin Lys Kis Leu Asn Ser Thr Glu Glu Tyr Leu Ala Phe 
20 25 30 



Leu Cvs Glv Pro Arg Arg Ser Kis Phe Phe Leu Pro Val Ser Val Val 
35 " " 40 45 



Tyr Val Pre He Fhe Val Val Gly Val He Gly Asn Val Leu Val Cys 
45 50 55 60 



Leu Val He Leu Gin His Gin Ala Met Lys Thr Pro Thr Asn lyr Tyr 
65 -: 75 so 

50 Leu Phe Ser Leu Ala Val Ser Aso Leu Leu Val Leu Leu Leu Gly Met: 
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; rc val Gly Cys Tyr^he Lys Thr Ala leu Fhe Glu Thr Val Cys Fhe 
115 HO 125 

Ala Ser lie Leu Ser He Thr Thr Val Ser Val Glu Arc Tyr Val Ala 

130 135 140 

He Leu Kis Pre Phe Arg Ala Lys Leu Gin Ser Thr Arg Arg Arg Ala 
145 150 155 160 

Leu Arg He Leu Gly He Val Trp Gly Fhe Ser Val Leu Phe Ser Leu 
165 170 175 

Pro Asn Thr Ser He His Gly He Lys Phe His Tyr Phe Pro Asn Gly 
180 185 190 

Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val He Lys Pro Met Trp 
195 200 205 

He Tyr Asn Phe He He Gin Val Thr Ser Phe Leu Phe Tyr Leu Leu 

210 215 220 

Pro Met Thr Val He Ser Val Leu Tyr Tyr Leu Met Ala Leu Arg Leu 



225 



230 235 240 



Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala Asn He Gin 
24 5 2 50 2 55 

Arg Pro Cys Arc Lvs Ser Val Asn Lys Met Leu Ser Leu Trp Arg Ser 
260 265 270 

Gly Val Asn Pro Trp Leu Leu Cys Ser Thr Ser Ser Met Trp Cys Gin 
275 280 285 

Val Ser Ser Ser Thr 
290 



<210> 13 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Degenerated 
primers 
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<4CG> 13 

c::acc"cg cggcgggcrc ngyngg 



5 <21C> 14 
< 2 1 1 > 31 
<212> DNA 

<213> Artificial Sequence 
10 <22C> 

<223> Description cf Artificial Sequence: Degenerated 
primers 

<220> 

15 <2 21> variation 
<222> (22) 
<223> C or Inosine 

<220> 

20 <221> variation 
<222> (25) 
<223> A, C , G or T 

<220> 

25 <221> variation 

<222> (28) 

<223 > A, C, G or T 

<4CC> 14 

30 ggccaggcag cgctccgcgc tnarncyngc d 



<21C> 15 

<211> 2 2 

35 <2 12> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Degenerated 
40 primers 

<4C0> 15 

gaartartag ccrcgrcagc cw 

45 

<210> 16 
<211> 27 

<2i2> d:;a 

<213> Artificial Sequence 

50 

<22C > 

<223 > Description of Artificial Sequence: Primer 
<4C0> 16 

55 ccatcctaat scgactcact atagg^c 




< 2 1 0 > 1 7 

<211> 2 3 

<2I2> DNA 

<213> Artificial Secruence 

5 

<22C> 

<223> Description of Artificial 

<4C0> 17 
10 actcactata gggctcgagc ggc 



<210> 13 

<211> 27 

15 <2 12> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial 

20 

<400> 18 

ggatcccaaa taagaaaggg tagttgc 



25 <210> 19 
<211> 29 
<212> DNA 

<213> Artificial Sequence 

30 <220> 

<223> Description of Artificial 

<400> 19 

aaaccatact tacgccacat cocatagac 

35 

<210> 20 

<211> 29 

<212> DNA 

40 <213> Artificial Sequence 

<220 > 

<223> Description of Artificial 

45 <40C> 20 

aggtctatga gatgtggccc aactaccct 
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Sequence : Primer 

23 

Sequence: Primer 

27 

Sequence : Primer 

2 9 

Sequence : Primer 
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10 



20 



25 



30 



<^1Z> 21 

< 2 1 1 > 2 3 
<212> 2NA 

<2 12> Artificial 2 e qu e:.:e 

< 2 2 0 > 

<223> Description cf Artificial Sequence: Prime: 
<4CC> 21 

acgtggcgca accacccctt cttatttggg 



<2 1 C > 22 
<211> 25 
15 <212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : Degenerated 
primers 

<400> 22 

cggaagttgg cggacacgr/ rttrta 26 



<210> 23 

<211> 27 

<212> DI7A 

<213> Artificial Sequence 

< 2 2 0 > 

<223 > Description of Artificial Sequence: Primer 

<4G0> 23 

gctcagctcg aaacagagcc tcgtacc 



<210> 24 

<2 11> 27 

40 <212> D:7A 

<2 13> Artificial Sequence 

< 2 2 0 > 

<223> Description cf Artificial Sequence: Primer 



45 



< 4 C C > 24 
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< 2 1 C > 2 3 
<211> 23 

< 2 1 2 > DNA 

<2 13> Artificial Sequence 
<22C> 

<222> Description of Artificial Sequence: Primer 
<4CC> 25 

aagacaaatc tcttgaggca gatgaaggg 



<210> 26 

<211> 30 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 26 

gatgctgttt gtcttggtct tagtgtttgc 



<210> 27 

<211> 29 

<212> DNA 
<213 > 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 



<400> 27 

ggatgatgaa attgtagarc cacatgggc 



29 



<210> 23 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
< 2 2 0 > 

<223> Description of Artificial Sequence: Primer 

<400> 28 

tgtggagaag tcoctcaaag tgtgg 



66 
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20 
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< 2 1 1 > 2 9 

< 2 1 2 > 2 *-7A 

< 2 2 2 > 

<222 > DescriDtic" cf Artificial Secruence : Primer 



< 4 C 0 > 2 5 

10 tagtaggagt gacagcctga ctcggaacg 



< 2 1 0 > 3 0 
<211> 3 0 
15 <212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<4C0> 30 

aacgtagatg actcaggacg aaccatttcc 30 



25 <2io> 31 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
30 <220> 

<223> Description of Artificial Sequence: Primer 



35 



<4C0> 31 

tcgtaccagg ggaggc t cag gc 



67 
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10 
15 

20 

3. 

25 

30 4. 

35 5. 



isolated polynucleotide comprising a nucleotide sequence selected from the grcup 
sisting of: 

a) a nucleotide sequence encoding the !GS4 polypeptide according to SEQ ID NO: 
2, SEQ ID NO: 4, SEQ ID NO: 6 or SEQ ID NO: 8. 



contained in the deposit no. CBS1 02221 or the deposit no. CBS1 02222 at the 
Centraalbureau voor Schimmelcultures at Baarn the Netherlands, in particular a 
nucleotide sequence corresponding to the SEQ ID NO: 1, SEQ ID NO: 3, SEQ 
ID NO: 5 or SEQ ID NO: 7. 

c) a nucleotide sequence having at least 80 % (preferably at least 90%) sequence 
identity over its entire length to the nucleotide sequence of (a) or (b); 

d) a nucleotide sequence which is complimentary to the nucleotide sequence of (a) 
or (b) or (c). 

The polynucleotide of claim 1 wherein said polynucleotide comprises the nucleotide 
sequence contained in SEQ ID NO: 1 encoding the IGS4 polypeptide of SEQ ID NO: 2 
or the nucleotide sequence contained in SEQ ID NO: 3 encoding the 1GS4 polypeptide 
of SEQ ID NO: 4 or the nucleotide sequence contained in SEQ ID NO: 5 encoding the 
IGS4 polypeptide of SEQ ID NO: 6 or the nucleotide sequence contained in SEQ ID NO: 
7 encoding the IGS4 polypeptide of SEQ ID NO: 8. 

The polynucleotide of claim 1 wherein said polynucleotide comprises a nucleotide 
sequence that is at least 80% identical to that of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID 
NO: 5 or SEQ ID NO: 7 or to the sequence of the DNA insert contained in the deposit 
no. CBS1 02221 or the deposit no. CBS1 02222 at the Centraalbureau voor 
Schimmelcultures at Baarn the Netherlands over its entire length. 

The polynucleotide of claim 3 which is the polynucleotide of SEQ ID NO: 1 , SEQ ID NO: 
3, SEQ ID NO: 5 or SEQ ID NO: 7 or the DNA insert contained in the deposit no. 
CBS102221 or the deposit no. CBS102222 at the Centraalbureau voor 
Schimmelcultures at Baarn the Netherlands. 

The polynucleotide of claim 1-4 which is DNA or RNA. 



b) 



a nucleotide sequence encoding the polypeptide encoded by the DNA insert 
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A DNA cr RNA molecule comprising an expression system, wnerein said expression 
system is capable of producing an IGS^ polypeptide comprising an amino ecic 
sequence, which has at least 80% identity with the polypeptide of SEQ ID NO: 2, SEQ 
ID NO: 4, SEQ ID NO: 6 or SEQ ID NO: 8 or with the polypeptide encoded by the DNA 
insert contained in the deposit no. CBS102221 or the deposit no. CBS102222 at the 
Centraaibureau voor Schimmelcultures at Baam the Netherlands, when said expression 
system is present in a compatible host ceil. 



A host cell comprising the expression system of claim 6. 
A host cell according to claim 7 which is a yeast cell 
A host cell according to claim 7 which is an animal cell 



IGS4 receptor membrane preparation derived from a ceil according to claim 7-9. 

A process for producing an IGS4 polypeptide comprising culturing a host of claim 7 
under conditions sufficient for the production of said polypeptide and recovering the 
polypeptide from the culture. 

A process for producing a cell which produces an IGS4 polypeptide comprising 
transforming or transfecting a host cell with the expression system of claim 6 such that 
the host ceil, under appropriate culture conditions, produces an IGS4 polypeptide. 

An IGS4 polypeptide comprising an amino acid sequence which is at least 80% identical 
to the ammo acid sequence of SEQ ID NO: 2, SEQ NO: 4, SEQ NO: 6 or SEQ NO: 8 or 
to the polypeptide encoded by the DNA insert contained in the deposit no. CBS1 02221 
or the deposit no. CBS1 02222 at the Centraaibureau voor Schimmelcultures at Baarn 
the Netherlands over its entire length. 

The polypeptide of claim 13 which comprises the amino acid sequence of SEQ ID NO: 
2. SEQ ID NO: 4, SEQ NO: 6 or SEQ NO: 8 or the amino acid sequence encoded by the 
DNA insert contained in the deposit no. CBS102221 or the deposit no. CBS102222 at 
the Centraaibureau voor Schimmelcultures at Baarn the Netherlands. 



An antibcdy imnuncscecific for the IGS4 polypeptide of claim 13. 
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A method for the treatment of a subject in need of enhanced activity or expression of 
IGS4 polypeptide of claim 13 comprising: 

(a) administering to the subject a therapeutically effective amount of an agonist to 
said receptor; and/or 

(b) providing to the subject an isolated polynucleotide comprising a nucleotide 
sequence that has at least 80% identity to a nucleotide sequence encoding the IGS4 
polypeptide of SEQ ID NO: 2, SEQ ID NO: 4, SEQ NO: 6 or SEQ NO: 8 or the 
polypeptide encoded by the DNA insert contained in the deposit no. CBS102221 or the 
deposit no. CBS1 02222 at the Centraalbureau voor Schimmelcultures at Baarn the 
Netherlands over its entire length; or a nucleotide sequence complementary to one of 
said nucleotide sequences in a form so as to effect production of said receptor activity in 
vivo. 

A method for the treatment of a subject having need to inhibit activity or expression of 
IGS4 polypeptide of claim 13 comprising: 

(a) administering to the subject a therapeutically effective amount of an antagonist 
to said receptor; and/or 

(b) administering to the subject a nucleic acid molecule that inhibits the expression 
of the nucleotide sequence encoding said receptor; and/or 

(c) administering to the subject a therapeutically effective amount of a polypeptide 
that competes with said receptor for its ligand. 

A process for diagnosing a disease or a susceptibility to a disease in a subject related to 
expression or activity of the IGS4 polypeptide of claim 13 in a subject comprising: 

(a) determining the presence or absence of a mutation in the nucleotide sequence 
encoding said IGS4 polypeptide in the genome of said subject; and/or 

(b) analyzing for the presence or amount of the IGS4 polypeptide expression in a 
sample derived from said subject. 

A method for identifying agonists to the IGS4 polypeptide of claim 13 comprising: 

(a) contacting a cell which produces a IGS4 polypeptide with a test compound; and 

(b) determining whether the test compound effects a signal generated by activation 
of the IGS4 polypeptide. 

An agonist identified by the method of claim 19. 
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The method for identifying antagonists to the IGS4 polypeptide of claim 13 comprising: 

(a) contacting a cell which produces a IGS4 polypeptide with an agonist; and 

(b) determining whether the signal generated by said agonist is diminished in the 
presence of a candidate compound. 

An antagonist identified by the method of claim 21 . 

A recombinant host cell produced by a method of claim 12 or a membrane thereof 
expressing an IGS4 polypeptide. 

A method of creating a genetically modified non-human animal comprising the steps of 

(a) ligating the coding portion of a nucleic acid molecule, consisting essentially of 
a nucleic acid sequence encoding a protein having the amino acid sequence SEQ ID 
NO: 2, SEQ ID NO: 4, SEQ NO: 6 or SEQ NO: 8 or the amino acid sequence encoded 
by the DNA insert contained in the deposit no. CBS1 02221 or the deposit no. 
CBS1 02222 at the Centraalbureau voor Schimmelcultures at Baarn the Netherlands or a 
biologically active portion of one of said sequences, to a regulatory sequence which is 
capable of driving high level gene expression or expression in a cell type in which the 
gene is not normally expressed in said animal; or 

(b) isolation and engineering the coding portion of a nucleic acid molecule, 
consisting essentially of a nucleic acid sequence encoding a protein having the amino 
acid sequence SEQ ID NO: 2, SEQ ID NO: 4, SEQ NO: 6 or SEQ NO: 8 or the amino 
acid sequence encoded by the DNA insert contained in the deposit no. CBS1 02221 or 
the deposit no. CBS1 02222 at the Centraalbureau voor Schimmelcultures at Baarn the 
Netherlands or a biologically active portion of one of said sequences, and reintroducing 
said sequence in the genome of said animal in such a way that the endogenous gene 
alleles, encoding a protein having the amino acid sequence SEQ ID NO: 2, SEQ ID NO: 
4, SEQ NO: 6 or SEQ NO: 8 or the amino acid sequence encoded by the DNA insert 
contained in the deposit no. CBS1 02221 or the deposit no. CBS1 02222 at the 
Centraalbureau voor Schimmelcultures at Baarn the Netherlands or a biologically active 
portion of one of said sequences, are fully or partially inactivated. 
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